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GFCs AS MODIFIERS OF THE IRRTK AND p21 PATHWAYS AND METHODS 

OF USE 

REFERENCE TO RELATED APPLICATIONS 

5 This application claims priority to U.S. provisional patent applications 60/305,016 

~m&&WkimQh and 60/328,567 f i l e d 1 0 /10/2001. Th e con t en t s of-4h& prior applications 
are hereby incorporated in their entirety. 

BACKGROUND OF THE INVENTION 

10 Signal transduction pathways are made up of growth factors, their receptors, upstream 

regulators of the growth factors, and downstream intracellular kinase networks. These 
pathways regulate and play crucial roles in many cellular processes, such as metabolism 
and proliferation. 

In humans, there are three members of the Insulin Receptor family of receptor tyrosine 

15 kinases (IRRTK): Insulin Receptor (InsR), Tnsulin like growth factor receptor (IGFR), and 
insulin receptor related receptor (IRR). 

The insulin receptor (InR) binds insulin, the major anabolic hormone in humans, and 
through subsequent receptor and substrate phosphorylation, elicits a pleiotropic response, 
activating multiple signaling pathways for metabolic (carbohydrate, lipid, and protein 

20 processing), and growth (cell proliferation and differentiation) control (Smith RM, et al., 
IntRev Cytol 1997;173:243-80). 

The type 1 insulin-like growth factor receptor (IGF-1R), a transmembrane tyrosine 
kinase, is widely expressed across many cell types in fetal and postnatal tissues, including 
motor and sensory neurons and glial cells. Activation of the receptor following binding of 

25 the secreted growth factor ligands IGF-1 and IGF-2 elicits a repertoire of cellular 

responses including proliferation, and the protection of cells from programmed cell death 
or apoptosis. As a result, signaling through the IGF-1R is the principal pathway 
responsible for somatic growth. Emerging evidence suggests that members of the IGF 
family, including IGF-1, IGF-2, the IGF-1 receptor (IGF-1R), and the IGF binding 

30 proteins (IGFBPs), also play important roles in the development and progression of 

cancer. Both in vitro and in vivo studies show that IGFs are strong mitogens for a variety 
of cancer cells. IGF-1 also has an antiapoptotic action on cancer. IGF-1R, overexpressed 
in cancer cells, mediates the effects of IGFs and plays a role in cell transformation induced 
by tumor virus and oncogene products (Yu H, Berkel H . J La State Med Soc 1999 
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Apr;151(4):218-23). Accordingly, genes identified in the IGFR growth signaling pathway 
are of interest as targets for proliferation, apoptosis, and neuronal survival. 

Insulin receptor-related receptor (IRR) is an orphan receptor in the insulin receptor 
(IR) family of receptor tyrosine kinases, and is primarily localized to neural crest-derived 
5 sensory neurons during embryonic development where it might play a key role in neuronal 

sutvivaLIn adults, ids also expressed in pancreatic beta cells. IRR has no known ligands 

(Hirayama I , et al., Diabetes 1999 Jun;48 (6): 1237-44; Tsujimoto K, et al., Neurosci Lett. 
1995 Mar 24; 188(2): 105-8; Reinhardt RR, et al., J Neurosci. 1994 Aug;14(8):4674-83; 
Reinhardt RR, et al, Endocrinology. 1993 Jul;133(l):3-10). 

10 In the Drosophila there is only one member of the IRRTKs, known as InR, thus 

subserving the function of its three human homologues. Interestingly, in Drosophila the 
insulin receptor can drive proliferation if expressed in undifferentiated tissue. 

The p21/CDKNlAVAFl/CIPl protein(El-Deiry, W. S.; et al. Cell 75: 817-825, 1993; 
Harper, J. W.; et al. Cell 75: 805-816, 1993; Huppi, Ket al. Oncogene 9: 3017-3020, 1994) 

15 is a cell cycle control protein that inhibits cyclin-kinase activity, is tightly regulated at the 
transcriptional level by p21, and mediates p21 suppression of tumor cell growth. Along 
with p21, p21 appears to be essential for maintaining the G2 checkpoint in human cells 
(Bunz, R; Dutriaux, A.; et al. Science 282:1497-1501, 1998). Sequences of P21 are well- 
conserved throughout evolution, and have been identified in species as diverse as human 

20 (Genbank Identifier 13643057), Drosophila mclanogaster (GI# 168491 1), Caenorhabditis 
elegans (GI#4966283), and yeast (GI#2656016). 

Glypicans (GPC) are proteins with very characteristic structures that are substituted 
with heparan sulfate and that are linked to the cell surface via 

glycosylphosphatidylinositol. The modular structure of the glypicans has been highly 
25 conserved throughout evolution. Six glypicans have been identified so far in vertebrates. 
Mutations in Drosophila, humans and mice reveal a role for these cell surface molecules in 
the control of cell growth, migration, and differentiation (De Cat B, and David G. Semin 
Cell Dev Biol 2001 Apr; 12(2): 117-25; Higashiyama S et al (1993) I Cell Biol. 122:933- 
940). 

30 Many GPCs play important roles in various disease conditions. GPC1 regulates 

growth factor action in pancreatic carcinoma cells and is overexpressed in human 
pancreatic cancer, and its expression in this cancer may enhance tumori genie potential in 
vivo (Kleeff, J., et al. (1999) Pancreas 19:281-8; Kleeff, J., et al. (1998) I Clin Invest 
102:1662-73; WO200023109). Levels of GPC1 are also increased in breast cacer 
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(WO200023109). GPC3 is expressed in hepatocarcinoma and hepatoma, and is 
downregulated in mesothelioma primary tumor (Hsu, H. C, et al. (1997) Cancer Res 
57:5179-84; Murthy, S. S., et al. (2000) Oncogene 19:410-6). Mutations in GPC3 are 
associated with Simpson-Golabi-Behmel syndrome (Pilia, G., et al. (1996) Nat Genet 12, 
5 241-7). GPC4 is expressed in various cell lines including mammary gland and breast 
-tortef-ed^feresv-a n d varian t f o rms o f GPC 4 coexisting -^ylth-GP€§ variant forms may 
explain phenotypic variations in individuals displaying Sirnpson-Golabi-Behmel syndrome 
(Veugelers, M., et al. (1998) Genomics 53:1-11). GPC4 may be involved in diseases of 
abnormal cell growth and behavior (W099/37764). 

10 The ability to manipulate the genomes of model organisms such as Drosophila 
provides a powerful means to analyze biochemical processes that, due to significant 
evolutionary conservation, have direct relevance to more complex vertebrate organisms. 
Due to a high level of gene and pathway conservation, the strong similarity of cellular 
processes, and the functional conservation of genes between these model organisms and 

15 mammals, identification of the involvement of novel genes in particular pathways and 

their functions in such model organisms can directly contribute to the understanding of the 
correlative pathways and methods of modulating them in mammals (see, for example, 
MechlerBMetal., 1985EMBOJ 4:1551-1557; GateffE. 1982 Adv. Cancer Res. 37: 33- 
74; Watson KL., et al., 1994 J Cell Sci. 18: 19-33; Miklos GL, and Rubin GM. 1996 Cell 

20 86:521-529; Wassarman DA, et al., 1995 Curr Opin Gen Dev 5: 44-50; and Booth DR. 
1999 Cancer Metastasis Rev. 18:261-284). For example, a genetic screen can be carried 
out in an invertebrate model organism having underexpression (e.g. knockout) or 
overexpression of a gene (referred to as a "genetic entry point") that yields a visible 
phenotype. Additional genes are mutated in a random or targeted manner. When a gene 

25 mutation changes the original phenotype caused by the mutation in the genetic entry point, 
the gene is identified as a "modifier" involved in the same or overlapping pathway as the 
genetic entry point. When the genetic entry point is an ortholog of a human gene 
implicated in a disease pathway, such as IRRTK or p21, modifier genes can be identified 
that may be attractive candidate targets for novel therapeutics. 

30 All references cited herein, including sequence information in referenced Genbank 

identifier numbers and website references, are incorporated herein in their entireties. 
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SUMMARY OF THE INVENTION 

We have discovered genes that modify the IRRTK and p21 pathways in and 
Drosophila, and identified their human orthologs, hereinafter referred to as GPC. The 
invention provides methods for utilizing these IRRTK or p21 modifier genes and 
5 polypeptides to identify GPC-modulating agents that are candidate therapeutic agents that 
eaa-be-ased4a- tfae treatment of disorders as s ociated with defeG 4ve-er4atgaged IRRTK or 
p21 function and/or GPC function. Preferred GPC-modulating agents specifically bind to 
GPC polypeptides and restore IRRTK or p21 function. Other preferred GPC-modulating 
agents are nucleic acid modulators such as antisense oligomers and RNAi that repress 

10 GPC gene expression or product activity by, for example, binding to and inhibiting the 
respective nucleic acid (i.e. DNA. or rnRNA). 

GPC modulating agents may be evaluated by any convenient in vitro or in vivo assay 
for molecular interaction with a GPC polypeptide or nucleic acid. In one embodiment, 
candidate GPC modulating agents are tested with an assay system comprising a GPC 

15 polypeptide or nucleic acid. Agents that produce a change in the activity of the assay 
system relative to controls are identified as candidate IRRTK or p21 modulating agents. 
The assay system may be cell-based or cell-free. GPC-modulating agents include GPC 
related proteins (e.g. dominant negative mutants, and biotherapeutics); GPC-specific 
antibodies; GPC-specific antisense oligomers and other nucleic acid modulators; and 

20 chemical agents that specifically bind to or interact with GPC or compete with GPC 

binding partner (e.g. by binding to a GPC binding partner). In one specific embodiment, a 
small molecule modulator is identified using a binding assay. In specific embodiments, 
the screening assay system is selected from an apoptosis assay, a cell proliferation assay, 
an angiogenesis assay, and a hypoxic induction assay. 

25 In another embodiment, candidate IRRTK or p21 pathways modulating agents are 

further tested using a second assay system that detects changes in the IRRTK or p21 
pathways, such as angiogenic, apoptotic, or cell proliferation changes produced by the 
originally identified candidate agent or an agent derived from the original agent. The 
second assay system may use cultured cells or non-human animals. In specific 

30 embodiments, the secondary assay system uses non-human animals, including animals 

predetermined to have a disease or disorder implicating the IRRTK or p21 pathways, such 
as an angiogenic, apoptotic, or cell proliferation disorder (e.g. cancer). 

The invention further provides methods for modulating the GPC function and/or the 
IRRTK or p21 pathways in a mammalian cell by contacting the mammalian cell with an 
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agent that specifically binds a GPC polypeptide or nucleic acid. The agent may be a small 
molecule modulator, a nucleic acid modulator, or an antibody and may be administered to 
a mammalian animal predetermined to have a pathology associated the IRRTK or p21 

pathways. 

5 

»ETAItEB4)ESGRIFHON^^fflE INVENTION 
Genetic screens were designed to identify modifiers of the IRRTK or p21 pathways in 
Drosophila. For IRRTK, the dominant negative form of InR was expressed in the eye, 
resulting in a small eye phenotype. For p21, human p21 gene was overexpressed in the 

10 eye, resulting in a small, rough eye phenotype. Both screens aimed to identify enhancers 
or suppressors of the eye phenotype. The Dally gene was identified as a modifier of both 
IRRTK and p21pathways. Accordingly, vertebrate orthologs of these modifiers, and 
preferably the human orthologs, glypican (GPC) genes (i.e., nucleic acids and 
polypeptides) are attractive drug targets for the treatment of pathologies associated with a 

15 defective IRRTK or p21signaling pathway, such as cancer. 

In vitro and in vivo methods of assessing GPC function are provided herein. 
Modulation of the GPC or their respective binding partners is useful for understanding the 
association of the IRRTK or p21 pathways and its members in normal and disease 
conditions and for developing diagnostics and therapeutic modalities for IRRTK or p21 

20 related pathologies. GPC-modulating agents that act by inhibiting or enhancing GPC 
expression, directly or indirectly, for example, by affecting a GPC function such as 
binding activity, can be identified using methods provided herein. GPC modulating agents 
are useful in diagnosis, therapy and pharmaceutical development. 

25 Nucleic acids and polypeptides of the invention 

Sequences related to GPC nucleic acids and polypeptides that can be used in the 
invention are disclosed in Genbank (referenced by Genbank identifier (GI) number) as 
Gl#s 4504080 (SEQ ID NOT), 18567116 (SEQ ID NO:3), 13632290 (SEQ ID NO:4), 
13632295 (SEQ ID NO:5), 4504082 (SEQ ID NO:6), 5360214 (SEQ ID NO:8), 3015541 
30 (SEQ ID NO:9), 4877642 (SEQ ID NO: 10), and 8051601 (SEQ ID NOT 1) for nucleic 
acid, and GI#s 4504081 (SEQ ID NO: 13), 1708021 (SEQ ID NO: 14), 13632291 (SEQ ID 
NO: 16), 4758462 (SEQ ID NO: 17), 11421168 (SEQ ID NO:18), 4504083 (SEQ ID 
NO: 19), 4758464 (SEQ ID NO:20), 9973298 (SEQ ID NO:21) and 5031719 (SEQ ID 
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NO:22) for polypeptides. Additionally, nucleic acid sequences of SEQ ID NOs: 2, 7, 12, 
23 and 24 amino acid sequence of SEQ ED NO: 15 can be used in the invention. 

GPCs are glycosylphosphatidylinositol-anchored cell surface heparan sulfate 
proteoglycan proteins with glypican domains. The term "GPC polypeptide" refers to a 
5 full-length GPC protein or a functionally active fragment or derivative thereof. A 

-''fttaeti0Bftlly-active' ? -GPC fragment or derivativ e exhibits one or mo m^aae-fional activities 
associated with a full-length, wild-type GPC protein, such as antigenic or immunogenic 
activity, ability to bind natural cellular substrates, etc. The functional activity of GPC 
proteins, derivatives and fragments can be assayed by various methods known to one 

10 skilled in the art (Current Protocols in Protein Science (1998) Coligan et al., eds,, John 
Wiley & Sons, Inc., Somerset, New Jersey) and as further discussed below. For purposes 
herein, functionally active fragments also include those fragments that comprise one or 
more structural domains of a GPC, such as a binding domain. Protein domains can be 
identified using the PFAM program (Bateman A., et al., Nucleic Acids Res, 1999, 27:260- 

15 2). For example, the glypican domains (PFAM 01 153) of GPC from GI#s4504081, 
4758462, 4504083, 4758464, and 5031719 (SEQ ID NOs: 13, 17, 19, 20, and 22, 
respectively) are located at approximately amino acid residues 2 to 557, 4 to 578, 1 to 555, 
2 to 572, and 7 to 554, respectively. Methods for obtaining GPC polypeptides are also 
further described below. In some embodiments, preferred fragments are functionally 

20 active, domain-containing fragments comprising at least 25 contiguous amino acids, 
preferably at least 50, more preferably 75, and most preferably at least 100 contiguous 
amino acids of any one of SEQ ED NOs: 13, 14, 15, 16, 17, 18, 19,20, 21, or 22 (a GPC). 
In further preferred embodiments, the fragment comprises the entire glypican (functionally 
active) domain. 

25 The term "GPC nucleic acid" refers to a DNA or RNA molecule that encodes a GPC 

polypeptide. Preferably, the GPC polypeptide or nucleic acid or fragment thereof is from 
a human, but can also be an ortholog, or derivative thereof with at least 70% sequence 
identity, preferably at least 80%, more preferably 85%, still more preferably 90%, and 
most preferably at least 95% sequence identity with GPC. Normally, orthologs in 

30 different species retain the same function, due to presence of one or more protein motifs 
and/or 3-dimensional structures. Orthologs are generally identified by sequence homology 
analysis, such as BLAST analysis, usually using protein bait sequences. Sequences are 
assigned as a potential ortholog if the best hit sequence from the forward BLAST result 
retrieves the original query sequence in the reverse BLAST (Huynen MA and Bork P, 
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Proc Natl Acad Sci (1998) 95:5849-5856; Huynen MA et al, Genome Research (2000) 
10: 1204-1210). Programs for multiple sequence alignment, such as CLUSTAL 
(Thompson ID et al, 1994, Nucleic Acids Res 22:4673-4680) may be used to highlight 
conserved regions and/or residues of orthologous proteins and to generate phylogenetic 
5 trees. In a phylogenetic tree representing multiple homologous sequences from diverse 
species (e-.g; , relie ved ^{mghSLAST-aBaiysis^r^FA&kigem sequences from tw& species 
generally appear closest on the tree with respect to all other sequences from these two 
species. Structural threading or other analysis of protein folding (e.g., using software by 
ProCeryon, Biosciences, Salzburg, Austria) may also identify potential orthologs. In 

10 evolution, when a gene duplication event follows speciation, a single gene in one species, 
such as Drosophila, may correspond to multiple genes (paraiogs) in another, such as 
human. As used herein, the term "orthologs" encompasses paraiogs. As used herein, 
"percent (%) sequence identity" with respect to a subject sequence, or a specified portion 
of a subject sequence, is defined as the percentage of nucleotides or amino acids in the 

15 candidate derivative sequence identical with the nucleotides or amino acids in the subject 
sequence (or specified portion thereof), after aligning the sequences and introducing gaps, 
if necessary to achieve the maximum percent sequence identity, as generated by the 
program WU-BLAST-2.0al9 (Altschul et al, J. Mol. Biol. (1997) 215:403-410) with all 
the search parameters set to default values. The HSP S and HSP S2 parameters are 

20 dynamic values and are established by the program itself depending upon the composition 
of. the particular sequence and composition of the particular database against which the 
sequence of interest is being searched. A % identity value is determined by the number of 
matching identical nucleotides or amino acids divided by the sequence length for which 
the percent identity is being reported. "Percent (%) amino acid sequence similarity" is 

25 determined by doing the same calculation as for determining % amino acid sequence 

identity, but including conservative amino acid substitutions in addition to identical amino 
acids in the computation. 

A conservative amino acid substitution is one in which an amino acid is substituted for 
another amino acid having similar properties such that the folding or activity of the protein 

30 is not significantly affected. Aromatic amino acids that can be substituted for each other 
are phenylalanine, tryptophan, and tyrosine; interchangeable hydrophobic amino acids are 
leucine, isoleucine, methionine, and valine; interchangeable polar amino acids are 
glutamine and asparagine; interchangeable basic amino acids are arginine, lysine and 
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histidine; interchangeable acidic amino acids are aspartic acid and glutamic acid; and 
interchangeable small amino acids are alanine, serine, threonine, cysteine and glycine. 

Alternatively, an alignment for nucleic acid sequences is provided by the local 
homology algorithm of Smith and Waterman (Smith and Waterman, 1.981, Advances in 
5 Applied Mathematics 2:482-489; database: European Bioinformatics Institute; Smith and 
Waterman, 1981, J. &f ^ MeteevBid^44?vl^5-49?t^iehefas et a l., 199&, "A Tutorial on 
Searching Sequence Databases and Sequence Scoring Methods" (www.psc.edu) and 
references cited therein.; W.R. Pearson, 1991, Genomics 11:635-650). This algorithm can 
be applied to amino acid sequences by using the scoring matrix developed by Dayhoff 

10 (Dayhoff: Atlas of Protein Sequences and Structure, M. O. Dayhoff ed., 5 suppl. 3:353- 
358, National Biomedical Research Foundation, Washington, D.C., USA), and normalized 
byGribskov (Gribskov 1986 Nucl. Acids Res. 14(6):6745-6763). The Smith-Waterman 
algorithm may be employed where default parameters are used for scoring (for example, 
gap open penalty of 12, gap extension penalty of two). From the data generated, the 

15 "Match" value reflects "sequence identity." 

Derivative nucleic acid molecules of the subject nucleic acid molecules include 
sequences that hybridize to the nucleic acid sequence of any of SEQ ID NOs:l, 2, 3, 4, 5, 
6, 7 ,8 ,9 10, 1 1, 12, 23 or 24. The stringency of hybridization can be controlled by 
temperature, ionic strength, pH, and the presence of denaturing agents such as formamide 

20 during hybridization and washing. Conditions routinely used are set out in readily 

available procedure texts {e.g., Current Protocol in Molecular Biology, Vol. 1, Chap. 2.10, 
John Wiley & Sons, Publishers (1994); Sambrook et ah, Molecular Cloning, Cold Spring 
Harbor (1989)). In some embodiments, a nucleic acid molecule of the invention is capable 
of hybridizing to a nucleic acid molecule containing the nucleotide sequence of any one of 

25 SEQ ID NOs: 1, 2, 3, ,4 ,5 ,6 ,7 ,8 ,9 10, 11, or 12 under stringent hybridization conditions 
that comprise: prehybridization of filters containing nucleic acid for 8 hours to overnight 
at 65° C in a solution comprising 6X single strength citrate (SSC) (IX SSC is 0.15 M 
NaCl, 0.015 M Na citrate; pH 7.0), 5X Denhardt's solution, 0.05% sodium pyrophosphate 
and 100 /ig/ml herring sperm DNA; hybridization for 18-20 hours at 65° C in a solution 

30 containing 6X SSC, IX Denhardt's solution, 100 tig/ml yeast tRNA and 0.05% sodium 
pyrophosphate; and washing of filters at 65° C for lh in a solution containing 0.2X SSC 
and 0.1% SDS (sodium dodecyl sulfate). 

In other embodiments, moderately stringent hybridization conditions are used that 
comprise: pretreatment of filters containing nucleic acid for 6 h at 40° C in a solution 
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containing 35% formamide, 5X SSC, 50 mM Tris-HCl (pH7.5), 5mM EDTA, 0.1% PVP, 
0.1% Ficoll, 1 % BSA, and 500 /xg/ml denatured salmon sperm DNA; hybridization for 
1 8-20h at 40° C in a solution containing 35% formamide, 5X SSC, 50 mM Tris-HCl 
(pH7.5), 5mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 /ig/ml salmon sperm 
5 DNA, and 10% (wt/vol) dextran sulfate; followed by washing twice for 1 hour at 55° C in 
a solution containing 2X SSC-afld-Orl^-SBS: — 

Alternatively, low stringency conditions can be used that comprise: incubation for 8 
hours to overnight at 37° C in a solution comprising 20% formamide, 5 x SSC, 50 mM 
sodium phosphate (pH 7.6), 5X Denhardt's solution, 10% dextran sulfate, and 20 jug/ml 
10 denatured sheared salmon sperm DNA; hybridization in the same buffer for 18 to 20 
hours; and washing of filters in 1 x SSC at about 37° C for 1 hour. 

Isolation, Production, Expression, and Mis-expression of GPC Nucleic Acids and 
Polypeptides 

15 GPC nucleic acids and polypeptides, useful for identifying and testing agents that 

modulate GPC function and for other applications related to the involvement of GPC in 
the IRRTK or p21 pathways. GPC nucleic acids and derivatives and orthologs thereof 
may be obtained using any available method. For instance, techniques for isolating cDNA 
or genomic DNA sequences of interest by screening DNA libraries or by using polymerase 

20 chain reaction (PCR) are well known in the art. In general, the particular use for the 
protein will dictate the particulars of expression, production, and purification methods. 
For instance, production of proteins for use in screening for modulating agents may 
require methods that preserve specific biological activities of these proteins, whereas 
production of proteins for antibody generation may require structural integrity of particular 

25 epitopes. Expression of proteins to be purified for screening or antibody production may 
require the addition of specific tags (e.g., generation of fusion proteins). Overexpression 
of a GPC protein for assays used to assess GPC function, such as involvement in cell cycle 
regulation or hypoxic response, may require expression in eukaryotic cell lines capable of 
these cellular activities. Techniques for the expression, production, and purification of 

30 proteins are well known in the art; any suitable means therefore may be used (e.g., Higgins 
SJ and Hames BD (eds.) Protein Expression: A Practical Approach, Oxford University 
Press Inc., New York 1999; Stanbury PF et al., Principles of Fermentation Technology, 2 nd 
edition, Elsevier Science, New York, 1995; Doonan S (ed.) Protein Purification Protocols, 
Humana Press, New Jersey, 1996; Coligan JE et al, Current Protocols in Protein Science 
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(eds.), 1999, John Wiley & Sons, New York). In particular embodiments, recombinant 
GPC is expressed in a cell line known to have defective p21 function such as HCT1 16 
colon cancer cells available from American Type Culture Collection (ATCC), Manassas, 
VA). The recombinant cells are used in cell-based screening assay systems of the 
5 invention, as described further below. 
Tfe nuek^ide seque^^ 
appropriate expression vector. The necessary transcriptional and translational signals, 
including promoter/enhancer element, can derive from the native GPC gene and/or its 
flanking regions or can be heterologous. A variety of host-vector expression systems may 

10 be utilized, such as mammalian cell systems infected with virus (e.g. vaccinia virus, 
adenovirus, etc.); insect cell systems infected with virus (e.g. baculovirus); 
microorganisms such as yeast containing yeast vectors, or bacteria transformed with 
bacteriophage, plasmid, or cosmid DNA. A host cell strain that modulates the expression 
of, modifies, and/or specifically processes the gene product may be used. 

15 To detect expression of the GPC gene product, the expression vector can comprise a 

promoter operably linked to a GPC gene nucleic acid, one or more origins of replication, 
and, one or more selectable markers (e.g. thymidine kinase activity, resistance to 
antibiotics, etc.). Alternatively, recombinant expression vectors can be identified by 
assaying for the expression of the GPC gene product based on the physical or functional 

20 properties of the GPC protein in in vitro assay systems (e.g. immunoassays). 

The GPC protein, fragment, or derivative may be optionally expressed as a fusion, or 
chimeric protein product (i.e. it is joined via a peptide bond to a heterologous protein 
sequence of a different protein), for example to facilitate purification or detection. A 
chimeric product can be made by ligating the appropriate nucleic acid sequences encoding 

25 the desired amino acid sequences to each other using standard methods and expressing the 
chimeric product. A chimeric product may also be made by protein synthetic techniques, 
e.g. by use of a peptide synthesizer (Ilunkapiller et al, Nature (1984) 310:105-11 1). 

Once a recombinant cell that expresses the GPC gene sequence is identified, the gene 
product can be isolated and purified using standard methods (e.g. ion exchange, affinity, 

30 and gel exclusion chromatography; centrifugation; differential solubility; electrophoresis). 
Alternatively, native GPC proteins can be purified from natural sources, by standard 
methods (e.g. immunoaffinity purification). Once a protein is obtained, it may be 
quantified and its activity measured by appropriate methods, such as immunoassay, 
bioassay, or other measurements of physical properties, such as crystallography. 
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The methods of this invention may also use cells that have been engineered for altered 
expression (mis-expression) of GPC or other genes associated with the IRRTK or p21 
pathways. As used herein, mis-expression encompasses ectopic expression, over- 
expression, under-expression, and non -expression (e.g. by gene knock-out or blocking 
5 expression that would otherwise normally occur). 



Genetically modified animals 

Animal models that have been genetically modified to alter GPC expression may be 
used in in vivo assays to test for activity of a candidate IRRTK or p21 modulating agent, 

10 or to further assess the role of GPC in a IRRTK or p21 pathways process such as apoptosis 
or cell proliferation. Preferably, the altered GPC expression results in a detectable 
phenotype, such as decreased or increased levels of cell proliferation, angiogenesis, or 
apoptosis compared to control animals having normal GPC expression. The genetically 
modified animal may additionally have altered IRRTK or p21 expression (e.g. IRRTK or 

15 p21 knockout). Preferred genetically modified animals are mammals such as primates, 
rodents (preferably mice), cows, horses, goats, sheep, pigs, dogs and cats. Preferred non- 
mammalian species include zebrafish, C. elegans, and Drosophila. Preferred genetically 
modified animals are transgenic animals having a heterologous nucleic acid sequence 
present as an extrachromosomal element in a portion of its cells, i.e. mosaic animals (see, 

20 for example, techniques described by Jakobovits, 1994, Curr. Biol. 4:761-763.) or stably 
integrated into its germ line DNA (i.e., in the genomic sequence of most or all of its cells). 
Heterologous nucleic acid is introduced into the germ line of such transgenic animals by 
genetic manipulation of, for example, embryos or embryonic stem cells of the host animal. 
Methods of making transgenic animals are well-known in the art (for transgenic mice 

25 see Brinster et al„ Proc. Nat. Acad. Sci. USA 82: 4438-4442 (1985), U.S. Pat. Nos. 

4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 by Wagner et al., 
and Hogan, B., Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, N.Y., (1986); for particle bombardment see U.S. Pat. No., 4,945,050, 
by Sandford et al; for transgenic Drosophila see Rubin and Spradling, Science (1982) 

30 218:348-53 and U.S. Pat. No. 4,670,388; for transgenic insects see Berghammer A.J. et 
al., A Universal Marker for Transgenic Insects (1999) Nature 402:370-371; for transgenic 
Zebrafish see Lin S., Transgenic Zebrafish, Methods Mol Biol. (2000); 136:375-3830); for 
microinjection procedures for fish, amphibian eggs and birds see Houdebine and 
Chourrout, Experientia (1991) 47:897-905; for transgenic rats see Hammer et al, Cell 

11 



WO 03/0066 1 1 



PCT/US02/21694 



(1990) 63:1099-1 1 12; and for culturing of embryonic stern (ES) cells and the subsequent 
production of transgenic animals by the introduction of DNA into ES cells using methods 
such as electroporation, calcium phosphate/DNA precipitation and direct injection sec, 
e.g., Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, E. J. Robertson, 
5 ed., IRL Press (1987)). Clones of the nonhuman transgenic animals can be produced 

aeeerdffig-to-av&rlable m e thods (see Wilmut, I. et al. (1997 ) Na tur e 38-5:^40-8-13-; arid PGT 
International Publication Nos. WO 97/07668 and WO 97/07669). 

In one embodiment, the transgenic animal is a "knock-out" animal having a 
heterozygous or homozygous alteration in the sequence of an endogenous GPC gene that 

10 results in a decrease of GPC function, preferably such that GPC expression is undetectable 
or insignificant. Knock-out animals are typically generated by homologous recombination 
with a vector comprising a transgene having at least a portion of the gene to be knocked 
out. Typically a deletion, addition or substitution has been introduced into the transgene 
to functionally disrupt it. The transgene can be a human gene (e.g., from a human 

15 genomic clone) but more preferably is an ortholog of the human gene derived from the 
transgenic host species. For example, a mouse GPC gene is used to construct a 
homologous recombination vector suitable for altering an endogenous GPC gene in the 
mouse genome. Detailed methodologies for homologous recombination in mice are 
available (see Capecchi, Science (1989) 244:1288-1292; Joyner et al, Nature (1989) 

20 338: 153-156). Procedures for the production of non-rodent transgenic mammals and other 
animals are also available (Houdebine and Chourrout, supra; Pursel et al., Science (1989) 
244:1281-1288; Simms et al, Bio/Technology (1988) 6:179-183). In a preferred 
embodiment, knock-out animals, such as mice harboring a knockout of a specific gene, 
may be used to produce antibodies against the human counterpart of the gene that has been 

25 knocked out (Claesson MH et al., (1994) Scan J Immunol 40:257-264; Declerck PJ et 
al., (1995) J Biol Chem. 270:8397-400). 

In another embodiment, the transgenic animal is a "knock-in" animal having an 
alteration in its genome that results in altered expression (e.g., increased (including 
ectopic) or decreased expression) of the GPC gene, e.g., by introduction of additional 

30 copies of GPC, or by operatively inserting a regulatory sequence that provides for altered 
expression of an endogenous copy of the GPC gene. Such regulatory sequences include 
inducible, tissue-specific, and constitutive promoters and enhancer elements. The knock- 
in can be homozygous or heterozygous. 
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Transgenic nonhuman animals can also be produced that contain selected systems 
allowing for regulated expression of the transgene. One example of such a system that 
may be produced is the cre/loxP recombinase system of bacteriophage PI (Lakso et al., 
PNAS (1992) 89:6232-6236; U.S. Pat. No. 4,959,317). If a cre/loxP recombinase system 
5 is used to regulate expression of the transgene, animals containing transgenes encoding 
both the Gre reeembinase and^-fleleeted^feteiB^fe^equired, Snch animals Gan be 
provided through the construction of "double" transgenic animals, e.g., by mating two 
transgenic animals, one containing a transgene encoding a selected protein and the other 
containing a transgene encoding a recombinase. Another example of a recombinase 

10 system is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. 
(1991) Science 251:1351-1355; U.S. Pat. No. 5,654,182). In a preferred embodiment, 
both Cre-LoxP and Flp-Frt are used in the same system to regulate expression of the 
transgene, and for sequential deletion of vector sequences in the same cell (Sun X et al 
(2000) Nat Genet 25:83-6). 

15 The genetically modified animals can be used in genetic studies to tether elucidate the 

IRRTK or p21 pathways, as animal models of disease and disorders implicating defective 
IRRTK or p21 function, and for in vivo testing of candidate therapeutic agents, such as 
those identified in screens described below. The candidate therapeutic agents are 
administered to a genetically modified animal having altered GPC function and 

20 phenotypic changes are compared with appropriate control animals such as genetically 
modified animals that receive placebo treatment, and/or animals with unaltered GPC 
expression that receive candidate therapeutic agent. 

In addition to the above-described genetically modified animals having altered GPC 
function, animal models having defective IRRTK or p21 function (and otherwise normal 

25 GPC function), can be used in the methods of the present invention. For example, a 

IRRTK or p21 knockout mouse can be used to assess, in vivo, the activity of a candidate 
IRRTK or p21 modulating agent identified in one of the in vitro assays described below. 
p2 1 knockout mice are described in the literature (Umanoff H, et al, Proc Natl Acad Sci U 
S A 1995 Feb 28;92(5): 1709-13). Preferably, the candidate IRRTK or p21 modulating 

30 agent when administered to a model system with cells defective in IRRTK or p21 

function, produces a detectable phenotypic change in the model system indicating that the 
IRRTK or p21 function is restored, i.e., the cells exhibit normal cell cycle progression. 
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Modulating Agents 

The invention provides methods to identify agents that interact with and/or modulate 
the function of GFC and/or the IRRTK or p21 pathways. Modulating agents identified by 
the methods are also part of the invention. Such agents are useful in a variety of 
5 diagnostic and therapeutic applications associated with the IRRTK or p21 pathways, as 
well artnttofther- analysis of the GPC protein and its contribution toThelRRTK or p21 
pathways. Accordingly, the invention also provides methods for modulating the IRRTK 
or p21 pathways comprising the step of specifically modulating GPC activity by 
administering a GPC-interacting or -modulating agent. 

10 As used herein, an "GPC-modulating agent" is any agent that modulated GPC 

function, for example, an agent that interacts with GPC to inhibit or enhance GPC activity 
or otherwise affect normal GPC function. GPC function can be affected at any level, 
including transcription, protein expression, protein localization, and cellular or extra- 
cellular activity. In a preferred embodiment, the GPC - modulating agent specifically 

15 modulates the function of the GPC. The phrases "specific modulating agent", 

"specifically modulates", etc., are used herein to refer to modulating agents that directly 
bind to the GPC polypeptide or nucleic acid, and preferably inhibit, enhance, or otherwise 
alter, the function of the GPC. These phrases also encompasses modulating agents that 
alter the interaction of the GPC with a binding partner, substrate, or cofactor (e.g. by 

20 binding to a binding partner of a GPC, or to a protein/binding partner complex, and 

altering GPC function). In a further preferred embodiment, the GPC- modulating agent is 
a modulator of the IRRTK or p21 pathways (e.g. it restores and/or upregulates IRRTK or 
p21 function) and thus is also a IRRTK or p21 -modulating agent. 

Preferred GPC-modulating agents include small molecule compounds; GPC- 

25 interacting proteins, including antibodies and other biotherapeutics; and nucleic acid 
modulators such as antisense and RNA inhibitors. The modulating agents may be 
formulated in pharmaceutical compositions, for example, as compositions that may 
comprise other active ingredients, as in combination therapy, and/or suitable carriers or 
excipients. Techniques for formulation and administration of the compounds may be 

30 found in "Remington's Pharmaceutical Sciences" Mack Publishing Co., Easton, PA, 19 th 
edition. 
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SraalJ molecule modulators 

Small molecules, are often preferred to modulate function of proteins with enzymatic 
function, and/or containing protein interaction domains. Chemical agents, referred to in 
5 the art as "small molecule" compounds are typically organic, non-peptide molecules, 
having a more^arweig more preferably 

less than 1,000, and most preferably less than 500. This class of modulators includes 
chemically synthesized molecules, for instance, compounds from combinatorial chemical 
libraries. Synthetic compounds may be rationally designed or identified based on known 

10 or inferred properties of the GPC protein or may be identified by screening compound 

libraries. Alternative appropriate modulators of this class are natural products, particularly 
secondary metabolites from organisms such as plants or fungi, which can also be 
identified by screening compound libraries for GPG-modulating activity. Methods for 
generating and obtaining compounds are well known in the art (Schreiber SL, Science 

15 (2000) 151: 1964-1969; Radmann J and Gunther J, Science (2000) 151:1947-1948). 

Small molecule modulators identified from screening assays, as described below, can 
be used as lead compounds from which candidate clinical compounds may be designed, 
optimized, and synthesized. Such clinical compounds may have utility in treating 
pathologies associated with the 1RRTK or p21 pathways. The activity of candidate small 

20 molecule modulating agents may be improved several-fold through iterative secondary 
functional validation, as furdier described below, structure determination, and candidate 
modulator modification and testing. Additionally, candidate clinical compounds are 
generated with specific regard to clinical and pharmacological properties. For example, the 
reagents may be derivatized and re-screened using in vitro and in vivo assays to optimize 

25 activity and minimize toxicity for pharmaceutical development. 

Protein Modulators 

Specific GPC-interacting proteins are useful in a variety of diagnostic and therapeutic 
applications related to the 1RRTK or p21 pathways and related disorders, as well as in 
30 validation assays for other GPC-modulating agents. In a preferred embodiment, GPC- 
interacting proteins affect normal GPC function, including transcription, protein 
expression, protein localization, and cellular or extra-cellular activity. In another 
embodiment, GPC-interacting proteins are useful in detecting and providing information 
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about the function of GPC proteins, as is relevant to TRRTK or p21 related disorders, such 
as cancer (e.g., for diagnostic means). 

An GPC-interacting protein may be endogenous, i.e. one that naturaily interacts 
genetically or biochemically with a GPC, such as a member of" the GPC pathway that 
5 modulates GPC expression, localization, and/or activity. GPC-modulators include 
dormnant ne^tive 

Yeast two-hybrid and variant screens offer preferred methods for identifying endogenous 
GPC-interacting proteins (Finley, R. L. et al. (1996) in DNA Cloning-Expression Systems: 
A Practical Approach, eds. Glover D. & Hames B. D (Oxford University Press, Oxford, 

10 England), pp. 169-203; Fashcma SF et al., Gene (2000) 250:1-14; Drees BL Curr Opin 
Chem Biol (1999) 3:64-70; Vidal M and Legrain P Nucleic Acids Res (1999) 27:919-29; 
and U.S. Pat. No. 5,928,868). Mass spectrometry is an alternative preferred method for 
the elucidation of protein complexes (reviewed in, e.g., Pandley A and Mann M, Nature 
(2000) 405:837-846; Yates JR 3 rd , Trends Genet (2000) 16:5-8). 

15 An GPC-interacting protein may be an exogenous protein, such as a GPC-specific 
antibody or a T-cell antigen receptor (sec, e.g., Harlow and Lane (1988) Antibodies, A 
Laboratory Manual, Cold Spring Harbor Laboratory; Harlow arid Lane (1999) Using 
antibodies: a laboratory manual. Cold Spring Harbor, NY: Cold Spring Harbor Laboratory 
Press). GPC antibodies are further discussed below. 

20 In preferred embodiments, a GPC-interacting protein specifically binds a GPC protein. 

In alternative preferred embodiments, a GPC -modulating agent binds a GPC substrate, 
binding partner, or c of actor. 

Antibodies 

25 In another embodiment, the protein modulator is a GPC specific antibody agonist or 

antagonist. The antibodies have therapeutic and diagnostic utilities, and can be used in 
screening assays to identify GPC modulators. The antibodies can also be used in 
dissecting the portions of the GPC pathway responsible for various cellular responses and 
in the general processing and maturation of the GPC. 

30 Antibodies that specifically bind GPC polypeptides can be generated using known 

methods. Preferably the antibody is specific to a mammalian orfholog of GPC 
polypeptide, and more 'preferably, to human GPC. Antibodies may be polyclonal, 
monoclonal (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab 
fragments, F(ab').sub.2 fragments, fragments produced by a FAb expression library, anti- 
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idiotypic (anti-Id) antibodies, and epitope-bindiug fragments of any of the above. 
Epitopes of GPC which are particularly antigenic can be selected, for example, by routine 
screening of GPC polypeptides for antigenicity or by applying a theoretical method for 
selecting antigenic regions of a protein (Hopp and Wood (1981), Proc. Nati. Acad. Sci. 
5 U.S.A. 78:3824-28; Hopp and Wood, (1983) Mol. Immunol. 20:483-89; Sutcliffe et al., 
fl983tiSdencer^9^6(^-66)-to- tlie amino acid s e quen c e sho w n in any of S E Q ID N Q s:1 3, 
14, 15, 16, 17, 18, 19, 20, 21, or 22. Monoclonal antibodies with affinities of 10 8 M" 1 
preferably 1 0 9 M 1 to 10 10 M" 1 , or stronger can be made by standard procedures as 
described (Harlow and Lane, supra; Goding (1986) Monoclonal Antibodies: Principles 

10 and Practice (2d ed) Academic Press, New York; and U.S. Pat. Nos. 4,381,292; 4,451,570; 
and 4,618,577). Antibodies may be generated against crude cell extracts of GPC or 
substantially purified fragments thereof. If GPC fragments are used, they preferably 
comprise at least 10, and more preferably, at least 20 contiguous amino acids of a GPC 
protein. In a particular embodiment, GPC-spccific antigens and/or immunogens are 

15 coupled to carrier proteins that stimulate the immune response. For example, the subject 
polypeptides are covalently coupled to the keyhole limpet hemocyanin (KLH) carrier, and 
the conjugate is emulsified in Freund's complete adjuvant, which enhances the immune 
response. An appropriate immune system such as a laboratory rabbit or mouse is 
immunized according to conventional protocols. 

20 The presence of GPC-specific antibodies is assayed by an appropriate assay such as a 
solid phase enzyme-linked immunosorbant assay (ELISA) using immobilized 
corresponding GPC polypeptides. Other assays, such as radioimmunoassays or 
fluorescent assays might also be used. 

Chimeric antibodies specific to GPC polypeptides can be made that contain different 

25 portions from different animal species. For instance, a human immunoglobulin constant 
region may be linked to a variable region of a murine mAb, such that the antibody derives 
its biological activity from the human antibody, and its binding specificity from the 
murine fragment. Chimeric antibodies are produced by splicing together genes that 
encode the appropriate regions from each species (Morrison et al., Proc. Natl. Acad. Sci. 

30 (1984) 81:6851-6855; Neuberger et al., Nature (1984) 312:604-608; Takeda et al., Nature 
(1985) 31:452-454). Humanized antibodies, which are a form of chimeric antibodies, can 
be generated by grafting complementary-determining regions (CDRs) (Carlos, T. M., J. M. 
Harlan. 1994. Blood 84:2068-2101) of mouse antibodies into a background of human 
framework regions and constant regions by recombinant DNA technology (Ricchmann 
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LM, etal., 1988 Nature 323: 323-327). Humanized antibodies contain -10% murine 
sequences and -90% human sequences, and thus further reduce or eliminate 
immunogenicity, while retaining the antibody specificities (Co MS, and Queen C. 1991 
Nature 351: 501-501; Morrison SL. 1992 Ann. Rev. Immun. 10:239-265). Humanized 
5 antibodies and methods of their production are well-known in the art (U.S. Pat. Nos. 
5-,53t);retr 5^585,-089, 5,693,702, a n d 6, 18 0,37 0 ). 

GPC-specific single chain antibodies which are recombinant, single chain polypeptides 
formed by linking the heavy and light chain fragments of the Fv regions via an amino acid 
bridge, can be produced by methods known in the art (U.S. Pat. No. 4,946,778; Bird, 

10 Science (1988) 242:423-426; Huston et al., Proc. Natl. Acad. Sci. USA (1988) 85:5879- 
5883; and Ward et al., Nature (1989) 334:544-546). 

Other suitable techniques for antibody production involve in vitro exposure of 
lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of 
antibodies in phage or similar vectors (Huse et al., Science (1989) 246:1275-1281). As 

15 used herein, T-ccll antigen receptors are included within the scope of antibody modulators 
(Harlow and Lane, 1988, supra). 

The polypeptides and antibodies of the present invention may be used with or without 
modification. Frequently, antibodies will be labeled by joining, either covalently or non- 
covalcntly, a substance that provides for a detectable signal, or that is toxic to cells that 

20 express the targeted protein (Menard S, et al., Int J. Biol Markers (1989) 4:131-134). A 
wide variety of labels and conjugation techniques arc known and are reported extensively 
in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, 
substrates, cofactors, inhibitors, fluorescent moieties, fluorescent emitting lanthanide 
metals, chemiluminescent moieties, biolumincscent moieties, magnetic particles, and the 

25 like (U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; 
and 4,366,241). Also, recombinant immunoglobulins may be produced (U.S. Pat. No. 
4,816,567). Antibodies to cytoplasmic polypeptides may be delivered and reach their 
targets by conjugation with membrane-penetrating toxin proteins (U.S. Pat. No. 
6,086,900). 

30 When used therapeutically in a patient, the antibodies of the subject invention are 

typically administered parenterally, when possible at the target site, or intravenously. The 
therapeutically effective dose and dosage regimen is determined by clinical studies. 
Typically, the amount of antibody administered is in the range of about 0. 1 mg/kg -to 
about 10 mg/kg of patient weight. For parenteral administration, the antibodies are 
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formulated in a unit dosage injectable form (e.g., solution, suspension, emulsion) in 
association with a pharmaceutical ly acceptable vehicle. Such vehicles are inherently 
nontoxic and non-therapeutic. Examples are water, saline, Ringer's solution, dextrose 
solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils, ethyl 
5 oleate, or liposome carriers may also be used. The vehicle may contain minor amounts of 
arldftrves; irodras and preservatives, which enhance isotomciry and chemical 

stability or otherwise enhance therapeutic potential. The antibodies' concentrations in 
such vehicles are typically in the range of about 1 mg/ml to aboutlO mg/ml. 
Immunotherapeutic methods are further described in the literature (US Pat. No. 5,859,206; 
10 WO0073469). 

Specific biotherapeutics 

In a preferred embodiment, a GPC-interacting protein may have biotherapeutic 
applications. Biotherapeutic agents formulated in pharmaceutically acceptable carriers 
15 and dosages may be used to activate or inhibit signal transduction pathways. This 

modulation may be accomplished by binding a ligand, thus inhibiting the activity of the 
pathway; or by binding a receptor, either to inhibit activation of, or to activate, the 
receptor. Alternatively, the biotherapeutic may itself be a ligand capable of activating or 
inhibiting a receptor. Biotherapeutic agents and methods of producing them are described 
20 in detail in U.S. Pat. No. 6,146,628. 

GPC ligand(s), antibodies to the ligand(s) or the GPC itself may be used as 
biotherapeutics to modulate the activity of GPC in the 1RRTK or p21 pathways. 

Nucleic Acid Modulators 

. 25 Other preferred GPC-modulating agents comprise nucleic acid molecules, such as 

antisense oligomers or double stranded RNA (dsRNA), which generally inhibit GPC 
activity. Preferred nucleic acid modulators interfere with the function of the GPC nucleic 
acid such as DNA replication, transcription, translocation of the GPC RNA to the site of 
protein translation, translation of protein from the GPC RNA, splicing of the GPC RNA to 
30 yield one or more mRNA species, or catalytic activity which may be engaged in or 
facilitated by the GPC RNA. 

In one embodiment, the antisense oligomer is an oligonucleotide that is sufficiently 
complementary to a GPC mRNA to bind to and prevent translation, preferably by binding 
to the 5' untranslated region. GPC-specific antisense oligonucleotides, preferably range 
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from at least 6 to about 200 nucleotides. In some embodiments the oligonucleotide is 
preferably at least 10, 15, or 20 nucleotides in length. In other embodiments, the 
oligonucleotide is preferably less than 50, 40, or 30 nucleotides in length. The 
oligonucleotide can be DNA or RNA or a chimeric mixture or derivatives or modified 
5 versions thereof, single-stranded or double-stranded. The oligonucleotide can be modified 
'm^^'t'^e'mcasty'imssrTiKsietyrox phosphate backbone. The oUgouucleotiderrrray 
include other appending groups such as peptides, agents that facilitate transport across the 
cell membrane, hybridization- triggered cleavage agents, and intercalating agents. 
In another embodiment, the antisense oligomer is a phosphothioate morpholino 
1 0 oligomer (PMO). PMOs are assembled from four different morpholino subunits, each of 
which contain one of four genetic bases (A, C, G, or T) linked to a six-membered 
morpholine ring. Polymers of these subunits are joined by non-ionic phosphodiamidate 
intersubunit linkages. Details of how to make and use PMOs and other antisense 
oligomers are well known in the art (e.g. see W099/18193; Probst JC, Antisense 
15 Oligodeoxynucleotide and Ribozyme Design, Methods. (2000) 22(3):271-281; Summerton 
J, and Weller D. 1997 Antisense Nucleic Acid Drug Dev. :7:187-95; US Pat. No. 
5,235,033; and US Pat No. 5,378,841). 

Alternative preferred GPC nucleic acid modulators are double-stranded RNA species 
mediating RNA interference (RNAi). RNAi is the process of sequence-specific, post- 
20 transcriptional gene silencing in animals and plants, initiated by double-stranded RNA 

(dsRNA) that is homologous in sequence to the silenced gene. Methods relating to the use 
of RNAi to silence genes in C. elegans, Drosophila, plants, and humans are known in the 
art (Fire A, et al., 1998 Nature 391:806-811; Fire, A. Trends Genet. 15, 358-363 (1999); 
Sharp, P. A. RNA interference 2001. Genes Dev. 15, 485-490 (2001); Hammond, S. M., 
25 et al., Nature Rev. Genet. 2, 110-11 19 (2001); Tuschl, T. Chem. Biochem. 2, 239-245 
(2001); Hamilton, A. et al., Science 286, 950-952 (1999); Hammond, S. M., et al., 
Nature 404, 293-296 (2000); Zamore, P. D., et al., Cell 101, 25-33 (2000); Bernstein, E., 
et al, Nature 409, 363-366 (2001); Elbashir, S. M., et al., Genes Dev. 15, 188-200 
(2001); WO0129058; W09932619; Elbashir SM, et al., 2001 Nature 411:494-498). 
30 Nucleic acid modulators are commonly used as research reagents, diagnostics, and 
therapeutics. For example, antisense oligonucleotides, which are able to inhibit gene 
expression with exquisite specificity, are often used to elucidate the function of particular 
genes (see, for example, U.S. Pat. No. 6,165,790). Nucleic acid modulators are also used, 
for example, to distinguish between functions of various members of a biological pathway. 
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For example, antisense oligomers have been employed as therapeutic moieties in the 
treatment of disease states in animals and man and have been demonstrated in numerous 
clinical trials to be safe and effective (Milligan JF, etal, Current Concepts in Antisense 
Drug Design, J Med Chem. (1993) 36:1923-1937; Tonkinson JL et al, Antisense 
5 Oligodeoxynucleotides as Clinical Therapeutic Agents, Cancer Invest. (1996) 14:54-65). 
Accordir4g4y T 4B-OB& aspect-ofAe4 avention, a GPC -Bp ecifio -meleis-aekt rnedalater is nsed- 
in an assay to further elucidate the role of the GPC in the IRRTK or p21 pathways, and/or 
its relationship to other members of the pathway. In another aspect of the invention, a 
GPC-specific antisense oligomer is used as a therapeutic agent for treatment of IRRTK or 
10 p21 -related disease states. 

Assay Systems 

The invention provides assay systems and screening methods for identifying specific 
modulators of GPC activity. As used herein, an "assay system" encompasses all the 

15 components required for performing and analyzing results of an assay that detects and/or 
measures a particular event. In general, primary assays are used to identify or confirm a 
modulator's specific biochemical or molecular effect with respect to the GPC nucleic acid 
or protein. In general, secondary assays further assess the activity of a GPC modulating 
agent identified by a primary assay and may confirm that the modulating agent affects 

20 GPC in a manner relevant to the IRRTK or p21 pathways. In some cases, GPC 
modulators will be directly tested in a secondary assay. 

In a preferred embodiment, the screening method comprises contacting a suitable 
assay system comprising a GPC polypeptide or nucleic acid with a candidate agent under 
conditions whereby, but for the presence of the agent, the system provides a reference 

25 activity (e.g. binding activity), which is based on the particular molecular event the 

screening method detects. A statistically significant difference between the agent-biased 
activity and the reference activity indicates that the candidate agent modulates GPC 
activity, and hence the IRRTK or p21 pathways. The GPC polypeptide or nucleic acid 
used in the assay may comprise any of the nucleic acids or polypeptides described above. 

30 

Primary Assays 

The type of modulator tested generally determines the type of primary assay. 
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Primary assays for small molecule modulators 

For small molecule modulators, screening assays are used to identify candidate 
modulators. Screening assays may be cell-based or may use a cell-free system that 
recreates or retains the relevant biochemical reaction of the target protein (reviewed in 
5 Sittampalam GS et al, Curr Opin Chem Biol (1997) 1:384-91 and accompanying 

references). As used herein the term "eell-based" refers to assays u-sing live cells, dead 
cells, or a particular cellular fraction, such as a membrane, endoplasmic reticulum, or 
mitochondrial fraction. The term "cell free" encompasses assays using substantially 
purified protein (either endogenous or recombinantly produced), partially purified or crude 

10 cellular extracts. Screening assays may detect a variety of molecular events, including 
protein-DNA interactions, protein-protein interactions {e.g., receptor-ligand binding), 
transcriptional activity (e.g., using a reporter gene), enzymatic activity (e.g., via a property 
of the substrate), activity of second messengers, immunogenicty and changes in cellular 
morphology or other cellular characteristics. Appropriate screening assays may use a wide 

15 range of detection methods including fluorescent, radioactive, colorimetric, 

spectrophotometric, and amperometric methods, to provide a read-out for the particular 
molecular event detected. 

Cell-based screening assays usually require systems for recombinant expression of 
GPC and any auxiliary proteins demanded by the particular assay. Appropriate methods 

20 for generating recombinant proteins produce sufficient quantities of proteins that retain 
their relevant biological activities and are of sufficient purity to optimize activity and 
assure assay reproducibility. Yeast two-hybrid and variant screens, and mass spectrometry 
provide preferred methods for determining protein-protein interactions and elucidation of 
protein complexes. In certain applications, when GPC-interacting proteins are used in 

25 screens to identity small molecule modulators, the binding specificity of the interacting 
protein to the GPC protein may be assayed by various known methods such as substrate 
processing (e.g. ability of the candidate GPC-specific binding agents to function as 
negative effectors in GPC-expressing cells), binding equilibrium constants (usually at least 
about 10 7 M~\ preferably at least about 10 s NT 1 , more preferably at least about 10 9 M" 1 ), 

30 and immunogenicity (e.g. ability to elicit GPC specific antibody in a heterologous host 
such as a mouse, rat, goat or rabbit). For enzymes and receptors, binding may be assayed 
by, respectively, substrate and ligand processing. 

lire screening assay may measure a candidate agent's ability to specifically bind to or 
modulate activity of a GPC polypeptide, a fusion protein thereof, or to cells or membranes 
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bearing the polypeptide or fusion protein. The GPC polypeptide can be full length or a 
fragment thereof that retains functional GPC activity. The GPC polypeptide may be fused 
to another polypeptide, such as a peptide tag for detection or anchoring, or to another tag. 
The GPC polypeptide is preferably human GPC, or is an ortholog or derivative thereof as 
5 described above. In a preferred embodiment, the screening assay detects candidate agent- 
ba^ed ffifjdulMioft ctf G^ 

exogenous protein or other substrate that has GPC -specific binding activity, and can be 
used to assess normal GPC gene function. 

Suitable assay formats that may be adapted to screen for GPC modulators arc known 

10 in the art. Preferred screening assays are high throughput or ultra high throughput and 
thus provide automated, cost-effective means of screening compound libraries for lead 
compounds (Fernand.es PB, Curr Opin Chem Biol (1998) 2:597-603; Sundberg SA, Curr 
Opin Biotechnol 2000, 1 1 :47-53). In one preferred embodiment, screening assays uses 
fluorescence technologies, including fluorescence polarization, time-resolved 

15 fluorescence, and fluorescence resonance energy transfer. These systems offer means to 
monitor protein-protein or DNA-protein interactions in which the intensity of the signal 
emitted from dye-labeled molecules depends upon their interactions with partner 
molecules (e.g., Selvin PR, Nat Struct Biol (2000) 7:730-4; Fernandes PB, supra; 
Hertzberg RP and Pope AJ, Curr Opin Chem Biol (2000) 4:445-451). 

20 A variety of suitable assay systems may be used to identify candidate GPC and IRRTK 

or p21 pathways modulators (e.g. U.S. Pat. Nos. 5,550,019 and 6,133,437 (apoptosis 
assays), among others). Specific preferred assays are described in more detail below. 

Apoptosis assays. Assays for apoptosis may be performed by terminal 
25 deoxynucleotidyl transferase-mediated digoxigenin-1 1-dUTP nick end labeling (TUNEL) 
assay. The TUNEL assay is used to measure nuclear DNA fragmentation characteristic of 
apoptosis ( Lazebnik et at, 1994, Nature 371, 346), by following the incorporation of 
fluorescein-dUTP (Yonehara et at, 1989, J. Exp. Med. 169, 1747). Apoptosis may further 
be assayed by acridine orange staining of tissue culture cells (Lucas, R., et al, 1998, Blood 
30 15:4730-41). An apoptosis assay system may comprise a cell that expresses a GPC, and 
that optionally has defective IRRTK orp21 function (e.g. IRRTK or p21 is over-expressed 
or under-expressed relative to wild-type cells). A test agent can be added to the apoptosis 
assay system and changes in induction of apoptosis relative to controls where no test agent 
is added, identify candidate IRRTK or p21 modulating agents. In some embodiments of 
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the invention, an apoptosis assay may be used as a secondary assay to test a candidate 
IRRTK or p21 modulating agents that is initially identified using a cell-free assay system. 
An apoptosis assay may also be used to test whether GPC function plays a direct role in 
apoptosis. For example, an apoptosis assay may be performed on cells that over- or under- 
5 express GPC relative to wild type cells. Differences in apoptotic response compared to 
wM type e«lt 

Apoptosis assays are described further in US Pat. No. 6,133,437. 

Cell proliferation and cell cycle assays. Cell proliferation may be assayed via 
10 bromodeoxyuridine (BRDU) incorporation. This assay identifies a cell population 
undergoing DNA synthesis by incorporation of BRDU into newly-synthesized DNA. 
Newly-synthesized DNA may then be detected using an anti-BRDU antibody (Hoshino et 
al., 1986, Int. J. Cancer 38, 369; Campana et al, 1988, J. Immunol. Meth. 107, 79), or by 
other means. 

15 Cell, Proliferation may also be examined using [ 3 II]-thymidinc incorporation (Chen, J., 

1996, Oncogene 13:1395-403; Jeoung, J., 1995, J. Biol. Chem. 270:18367-73). This 
assay allows for quantitative characterization of S-phase DNA syntheses. In this assay, 
cells synthesizing DNA will incorporate [ 3 H]-thymidine into newly synthesized DNA. 
Incorporation can then be measured by standard techniques such as by counting of 

20 radioisotope in a scintillation counter (e.g., Beckman LS 3800 Liquid Scintillation 
Counter). 

Cell proliferation may also be assayed by colony formation in soft agar (Sambrook et 
al., Molecular Cloning, Cold Spring Harbor (1989)). For example, cells transformed with 
GPC are seeded in soft agar plates, and colonies are measured and counted after two 

25 weeks incubation. 

Involvement of a gene in the cell cycle may be assayed by flow cytometry (Gray JW et 
al. (1 986) Int J Radial Biol Rclat Stud Phys Chem Med 49:237-55). Cells transfected with 
a GPC may be stained with propidium iodide and evaluated in a flow cytometer (available 
from Bccton Dickinson). 

30 Accordingly, a cell proliferation or cell cycle assay system may comprise a cell that 
expresses a GPC, and that optionally has defective IRRTK or p21 function (e.g. IRRTK or 
p21 is over-expressed or under-expressed relative to wild-type cells). A test agent can be 
added to the assay system and changes in cell proliferation or ceil cycle relative to controls 
where no test agent is added, identify candidate IRRTK or p21 modulating agents. In 
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some embodiments of the invention, the cell proliferation or cell cycle assay may be used 
as a secondary assay to test a candidate IRRTK or p21 modulating agents that is initially 
identified using another assay system such as a cell-free assay system. A cell proliferation 
assay may also be used to test whether GPC function plays a direct role in cell 
5 proliferation or cell cycle. For example, a cell proliferation or cell cycle assay may be 
performed €»eeif s ttet over^^ 

Differences in proliferation or cell cycle compared to wild type cells suggests that the 
GPC plays a direct role in cell proliferation or cell cycle. 

10 Angiogenesis. Angiogenesis may be assayed using various human endothelial cell 
systems, such as umbilical vein, coronary artery, or dermal cells. Suitable assays include 
Alamar Blue based assays (available from Biosource International) to measure 
proliferation; migration assays using fluorescent molecules, such as the use of Becton 
Dickinson Falcon HTS FluoroBlock cell culture inserts to measure migration of cells 

15 through membranes in presence or absence of angiogenesis enhancer or suppressors; and 
tubule formation assays based on the formation of tubular structures by endothelial cells 
on Matrigel® (Becton Dickinson). Accordingly, an angiogenesis assay system may 
comprise a cell that expresses a GPC, and that optionally has defective IRRTK or p21 
function (e.g. IRRTK or p21 is over-expressed or under-expressed relative to wild-type 

20 cells). A test agent can be added to the angiogenesis assay system and changes in 

angiogenesis relative to controls where no test agent is added, identify candidate IRRTK 
or p21 modulating agents. In some embodiments of the invention, the angiogenesis assay 
may be used as a secondary assay to test a candidate IRRTK or p21 modulating agents that 
is initially identified using another assay system. An angiogenesis assay may also be used 

25 to test whether GPC function plays a direct role in cell proliferation. For example, an 

angiogenesis assay may be performed on cells that over- or under-express GPC relative to 
wild type cells. Differences in angiogenesis compared to wild type cells suggests that the 
GPC plays a direct role in angiogenesis. 

30 Hypoxic induction. The alpha subunit of the transcription factor, hypoxia inducible 

factor-] (HIF-1), is upregulated in tumor cells following exposure to hypoxia in vitro. 
Under hypoxic conditions, HIF-1 stimulates the expression of genes known to be 
important in tumour cell survival, such as those encoding glyolytic enzymes and VEGF. 
Induction of such genes by hypoxic conditions may be assayed by growing cells 
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transfected with GPC in hypoxic conditions (such as with 0.1% 02, 5% C02, and balance 
N2, generated in a Napco 7001 incubator (Precision Scientific)) and norrnoxic conditions, 
followed by assessment of gene activity or expression by Taqman®. For example, a 
hypoxic induction assay system may comprise a cell that expresses a GPC, and that 
5 optionally has a mutated IRRTK or p21 (e.g. IRRTK or p21 is over-expressed or under- 
expressed relative to wild-4ype-^elfsVA-tes4-ageflt-ean be added to the -hypoxic -induction 
assay system and changes in hypoxic response relative to controls where no test agent is 
added, identify candidate IRRTK or p21 modulating agents. In some embodiments of the 
invention, the hypoxic induction assay may be used as a secondary assay to test a 

10 candidate IRRTK or p21 modulating agents that is initially identified using another assay 
system. A hypoxic induction assay may also be used to test whether GPC function plays a 
direct role in the hypoxic response. For example, a hypoxic induction assay may be 
performed on cells that over- or under-express GPC relative to wild type cells. 
Differences in hypoxic response compared to wild type cells suggests that the GPC plays a 

15 direct role in hypoxic induction. 

Cell adhesion. Cell adhesion assays measure adhesion of cells to purified adhesion 
proteins, or adhesion of cells to each other, in presence or absence of candidate 
modulating agents. Cell-protein adhesion assays measure the ability of" agents to modulate 

20 the adhesion of cells to purified proteins. For example, recombinant proteins are 

produced, diluted to 2.5g/mL in PBS, and used to coat the wells of a microtiter plate. The 
wells used for negative control are not coated. Coated wells are then washed, blocked 
with 1% BSA, and washed again. Compounds are diluted to 2x final test concentration 
and added to the blocked, coated wells. Cells are then added to the wells, and the unbound 

25 cells are washed off. Retained cells are labeled directly on the plate by adding a 

membrane-permeable fluorescent dye, such as calcein-AM, and the signal is quantified in 
a fluorescent microplatc reader. 

Cell-cell adhesion assays measure the ability of agents to modulate binding of cell 
adhesion proteins with their native ligands. These assays use cells that naturally or 

30 rccombinantly express the adhesion protein of choice. In an exemplary assay, cells 
expressing the cell adhesion protein are plated in wells of a multiwell plate. Cells 
expressing the ligand are labeled with a membrane-permeable fluorescent dye, such as 
BCECF , and allowed to adhere to the monolayers in the presence of candidate agents. 
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Unbound cells are washed off, and bound cells are detected using a fluorescence plate 
reader. 

High-throughput cell adhesion assays have also been described. In one such assay, 
small molecule ligands and peptides are bound to the surface of microscope slides using a 
5 microarray spotter, intact cells are then contacted with the slides, and unbound cells are 
washed off . fa this assay nGt-es^y-ths-hh^ding-fipeeifieity ef the pep-tides and modulators 
against cell lines are determined, but also the functional cell signaling of attached cells 
using immunofluorescence techniques in situ on the microchip is measured (Falsey JR et 
al., Bioconjug Chem. 2001 May-Jun;12(3):346-53). 

10 

Primary assays for antibody modulators 

For antibody modulators, appropriate primary assays test is a binding assay that tests 
the antibody's affinity to and specificity for the GPC protein. Methods for testing antibody 
affinity and specificity are well known in the art (Harlow and Lane, 1988, 1999, supra). 
15 The enzyme-linked immunosorbant assay (ELISA) is a preferred method for detecting 
GPC-specific antibodies; others include FACS assays, radioimmunoassays, and 
fluorescent assays. 

Primary assays for nucleic acid modulators 

20 For nucleic acid modulators, primary assays may test the ability of the nucleic acid 
modulator to inhibit or enhance GPC gene expression, preferably mRNA expression. In 
general, expression analysis comprises comparing GPC expression in like populations of 
cells (e.g., two pools of cells that endogenously or recombinantly express GPC) in the 
presence and absence of the nucleic acid modulator. Methods for analyzing mRNA and 

25 protein expression are well known in the art. For instance, Northern blotting, slot blotting, 
ribonuclease protection, quantitative RT-PCR (e.g., using the TaqMan®, PE Applied 
Biosystems), or microarray analysis may be used to confirm that GPC mRNA expression 
is reduced in cells treated with the nucleic acid modulator (e.g., Current Protocols in 
Molecular Biology (1994) Ausubel FM era/., eds., John Wiley & Sons, Inc., chapter 4; 

30 Freeman WM et al, Biotechniques (1999) 26:112-125; Kallioniemi OP, Ann Med 2001, 
33: 142-147; Blohm DH and Guiseppi-Elie, A Curr Opin Biotechnol 2001, 12:41-47). 
Protein expression may also be monitored. Proteins are most commonly detected with 
specific antibodies or antisera directed against either the GPC protein or specific peptides. 
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A variety of means including Western blotting, ELISA, or in situ detection, are available 
(Harlow E and Lane D, 1988 and 1999, supra). 

Secondary Assays 

5 Secondary assays may be used to further assess the activity of GPC-modulating agent 

i dentified by any of the ahoreTnetrrods-tcnOTrf modulating agent affects GPC 

in a manner relevant to the IRRTK or p21 pathways. As used herein, GPC-modulating 
agents encompass candidate clinical compounds or other agents derived from previously 
identified modulating agent. Secondary assays can also be used to test the activity of a 

10 modulating agent on a particular genetic or biochemical pathway or to test the specificity 
of the modulating agent's interaction with GPC. 

Secondary assays generally compare like populations of cells or animals (e.g., two 
pools of cells or animals that endogenously or recombinantly express GPC) in the 
presence and absence of the candidate modulator. In general, such assays test whether 

15 treatment of cells or animals with a candidate GPC-modulating agent results in changes in 
the IRRTK or p21 pathways in comparison to untreated (or mock- or placebo-treated) cells 
or animals. Certain assays use "sensitized genetic backgrounds", which, as used herein, 
describe cells or animals engineered for altered expression of genes in the IRRTK or p21 
or interacting pathways. 

20 

Cell-based assays 

Cell based assays may use cell lines known to have defective IRRTK or p21 function, 
such as HCT1 16 colon cancer cells with defective p21, available from American Type 
Culture Collection (ATCC), Manassas, VA). 
25 Cell based assays may detect endogenous IRRTK or p21 pathway activity or may rely 

on recombinant expression of IRRTK or p21 pathway components. Any of the 
aforementioned assays may be used in this cell-based format. Candidate modulators are 
typically added to the cell media but may also be injected into cells or delivered by any 
other efficacious means. 

30 

Animal Assays 

A variety of non-human animal models of normal or defective IRRTK or p21 
pathways may be used to test candidate GPC modulators. Models for defective IRRTK or 
p21 pathways typically use genetically modified animals that have been engineered to 
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mis-express (e.g., over-express or lack expression in) genes involved in the IRRTK or p21 
pathways. Assays generally require systemic delivery of the candidate modulators, such' 
as by oral administration, injection, etc. 

In a preferred embodiment, IRRTK or p21 pathways activity is assessed by monitoring 
5 neovascularization and angiogenesis. Animal models with defective and normal IRRTK 
or p21 -are-trsedr to test the candrdatermoiMatorVafifect-on GPC in Matrigei® assays. 
Matrigel© is an extract of basement membrane proteins, and is composed primarily of 
laminin, collagen IV, and heparin sulfate proteoglycan. It is provided as a sterile liquid at 
4° C, but rapidly forms a solid gel at 37° C. Liquid Matrigel® is mixed with various 

10 angiogenic agents, such as bFGF and VEGF, or with human tumor cells which over- 
express the GPC. The mixture is then injected subcutaneously(SC) into female athymic 
nude mice (Taconic, Germantown, NY) to support an intense vascular response. Mice 
.with Matrigel© pellets may be dosed via oral (PO), intraperitoneal (IP), or intravenous 
(IV) routes with the candidate modulator. Mice are euthanized 5 - 12 days post-injection, 

15 and the Matrigel® pellet is harvested for hemoglobin analysis (Sigma plasma hemoglobin 
kit). Hemoglobin content of the gel is found to correlate the degree of neovascularization 
in the gel. 

In another preferred embodiment, the effect of the candidate modulator on GPC is 
assessed via tumorigenicity assays. In one example, xenograft human tumors are 

20 implanted SC into female athymic mice, 6-7 week old, as single cell suspensions either 
from a pre-existing tumor or from in vitro culture. The tumors which express the GPC 
endogenously are injected in the flank, 1 x 10 5 to 1 x 10 7 cells per mouse in a volume of 
100 /iL using a 27gauge needle. Mice are then ear tagged and tumors are measured twice 
weekly. Candidate modulator treatment is initiated on the day the mean tumor weight 

25 reaches 100 mg. Candidate modulator is delivered lV, SC, IP, or PO by bolus 
administration. Depending upon the pharmacokinetics of each unique candidate 
modulator, dosing can be performed multiple times per day. The tumor weight is assessed 
by measuring perpendicular diameters with a caliper and calculated by multiplying the 
measurements of diameters in two dimensions. At the end of the experiment, the excised 

30 tumors maybe utilized for biomarker identification or further analyses. For 

immunohistochemistry staining, xenograft tumors are fixed in 4% paraformaldehyde, 
0.1M phosphate, pH 7.2, for 6 hours at4°C, immersed in 30% sucrose in PBS, and rapidly 
frozen in isopentane cooled with liquid nitrogen. 
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Diagnostic and therapeutic uses 

Specific GPC-modulating agents are useful in a variety of diagnostic and therapeutic 
applications where disease or disease prognosis is related to defects in the IRRTK or p21 
pathways, such as angiogenic, apoptotic, or cell proliferation disorders. Accordingly, the 
5 invention also provides methods for modulating the IRRTK or p21 pathways in a cell, 

prefe^y a cell pre-4ete^ 

(e.g. due to overexpression, underexpression, or misexpression of IRRTK or p21, or due to 
gene mutations), comprising the step of administering an agent to the cell that specifically 
modulates GPC activity. Preferably, the modulating agent produces a detectable 

10 phenotypic change in the cell indicating that the IRRTK or p21 function is restored. The 
phrase "function is restored", and equivalents, as used herein, means that the desired 
phenotype is achieved, or is brought closer to normal compared to untreated cells. For 
example, with restored IRRTK or p21 function, cell proliferation and/or progression 
through cell cycle may normalize, or be brought closer to normal relative to untreated 

15 cells. The invention also provides methods for treating disorders or disease associated 
with impaired IRRTK or p21 function by administering a therapeutically effective amount 
of a GPC -modulating agent that modulates the IRRTK or p21 pathways. The invention 
further provides methods for modulating GPC function in a cell, preferably a cell pre- 
determined to have defective or impaired GPC function, by administering a GPC - 

20 modulating agent. Additionally, the invention provides a method for treating disorders or 
disease associated with impaired GPC function by administering a therapeutically 
effective amount of a GPC -modulating agent. 

The discovery that GPC is implicated in IRRTK or p21 pathways provides for a 
variety of methods that can be employed for the diagnostic and prognostic evaluation of 

25 diseases and disorders involving defects in the IRRTK or p21 pathways and for the 
identification of subjects having a predisposition to such diseases and disorders. 

Various expression analysis methods can be used to diagnose whether GPC expression 
occurs in a particular sample, including Northern blotting, slot blotting, ribonuclease 
protection, quantitative RT-PCR, and microarray analysis, (e.g., Current Protocols in 

30 Molecular Biology (1994) Ausubel FM et ah, eds., John Wiley & Sons, Inc., chapter 4; 
Freeman WM et al, Biotechniques (1999) 26:112-125; Kallioniemi OP, Ann Med 2001, 
33:142-147; Blohm and Guiseppi-Elie, Curr Opin Biotechnol 2001, 12:41-47). Tissues 
having a disease or disorder implicating defective IRRTK or p21 signaling that express a 
GPC, are identified as amenable to treatment with a GPC modulating agent. In a preferred 
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application, the IRRTK or p21 defective tissue overexpresses a GPC relative to normal 
tissue. For example, a Northern blot analysis of mJRNA from tumor and normal cell lines, 
or from tumor and matching normal tissue samples from the same patient, using full or 
partial GPC cDNA sequences as probes, can determine whether particular tumors express 
5 or overexpress GPC. Alternatively, the TaqMan® is used for quantitative RT-PCR 
analysis ofljPCexpresScrn in cell lines, normal tissues and tumor samples (PE Applied 
Biosystems). 

Various other diagnostic methods may be performed, for example, utilizing reagents 
such as the GPC oligonucleotides, and antibodies directed against a GPC, as described 

1 0 above for: (1) the detection of the presence of GPC gene mutations, or the detection of 
either over- or under-expression of GPC mRNA relative to the non-disorder state; (2) the 
detection of either an over- or an under-abundance of GPC gene product relative to the 
non-disorder state; and (3) the detection of perturbations or abnormalities in the signal 
transduction pathway mediated by GPC. 

15 Thus, in a specific embodiment, the invention is drawn to a method for diagnosing a 
disease or disorder in a patient that is associated with alterations in GPC expression, the 
method comprising: a) obtaining a biological sample from the patient; b) contacting the 
sample with a probe for GPC expression; c) comparing results from step (b) with a 
control; and d) determining whether step (c) indicates a likelihood of the disease or 

20 disorder. Preferably, the disease is cancer, most preferably a cancer as shown in TABLE 
1. The probe may be either DNA or protein, including an antibody. 

EXAMPLES 

The following experimental section and examples are offered by way of illustration 
25 and not by way of limitation. 

I. Drosophila screens 
An EP overexpression screen was carried out in Drosophila to identify genes that 
modify the small eye phenotype resulting from expression of a dominant-negative InR in 
30 the eye. The EP collection contains a large set of Drosophila lines bearing P insertions 
that overexpress the gene in which they are inserted. Each EP line was crossed to lines 
expressing the dominant negative InR in the eye. Resulting progeny were examined for a 
change in the small eye phenotype. Sequence information surrounding the P insertion site 
was used to identify the overexpressed genes. 
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A dominant loss of function screen was carried out in Drosophila to identify genes that 
interact with the cyclin dependent kinase inhibitor, p21 (Bourne HR, et al., Nature (1990) 
348(6297): 125-132; Marshall CJ, Trends Genet (1991) 7(3):91-95). Expression of thep21 
gene from GMR-p21 transgene (Hay, B. A., et al. (1994) Developmentl20:2121-2129) in 
5 the eye causes deterioration of normal eye morphology, resulting in reduced, rough eyes. 
FMes-eaayiag-t b i s tra n s gene were maintained as a stock (P 1025-fi-genotype: y w; P{p21~ 
pExp-gl-w[+]Hsp70(3'UTR)-5}). Females of this stock were crossed to a collection of 
males carrying piggyBac insertions (FraserM era/., Virology (1985) 145:356-361). 
Resulting progeny carrying both the transgene and transposons were scored for the effect 
10 of the transposon on the eye phenolypc, i.e. whether the transposon enhanced or 

suppressed (or had no effect) the eye phenotype. All data was recorded and all modifiers 
were retested with a repeat of the original cross, and the retests were scored at least twice. 
Modifiers of the eye phenotype were identified as members of the p21 pathway. 

The Drosophila Dally gene (Genbank Identifier number 3023638), was identified as 
15 an enhancer of the small eye phenotype in both the IRRTK and p21 screens, and hence a 
member of the IRRTK and p21pathways. Human orthologs of the modifiers are referred 
to herein as GPC. 

BLAST analysis (Altschul et al., supra) was employed to identify Targets from 
Drosophila modifiers. For example, representative sequences from GPC, GI#s 4504081, 

20 4758462, 4504083, 4758464, and 5031719 (SEQ ID NOs:13, 17, 19, 20, and 22, 

respectively), share 24%, 24%, 22%, 22%, 28, and 23% amino acid identity, respectively, 
with the Drosophila Dally. 

Various domains, signals, and functional subunits in proteins were analyzed using the 
PSORT (Nakai K., and Horton P., Trends Biochem Sci, 1999, 24:34-6; Kenta Nakai, 

25 Protein sorting signals and prediction of subcellular localization, Adv. Protein Chem. 54, 
277-344 (2000)), PFAM (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2), 
SMART (Ponting CP, et al., SMART: identification and annotation of domains from 
signaling and extracellular protein sequences. Nucleic Acids Res. 1999 Jan 1;27(1):229- 
32), TM-HMM (Erik L.L. Sonnhammer, Gunnar von Heijne, and Anders Krogh: A hidden 

30 Markov model for predicting transmembrane helices in protein sequences. In Proc. of 
Sixth Int. Conf. on Intelligent Systems for Molecular Biology, p 175-182 Ed J. Glasgow, 
T. Littlejohn, F. Major, R. Lathrop, D. Sankoff, and C. Sensen Menlo Park, CA: AAAI 
Press, 1 998), and clust (Remm M, and Sonnhammer E. Classification of transmembrane 
protein families in the Caenorhabditis elegans genome and identification of human 
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orthologs. Genome Res. 2000 Nov;10(ll):1679-89) programs. For example, the glypiean 
domains (PFAM 01153) of GPC from GI#s4504081, 4758462, 4504083, 4758464, and 
5031719 (SEQ ID NOs:13, 17, 19, 20, and 22, respectively) are located at approximately 
amino acid residues 2 to 557, 4 to 578, 1 to 555, 2 to 572, and 7 to 554, respectively 

5 

fl. High^Thron ghput In Vitro-BmdrmrAssay: ; — 

33 P-labelcd GPC peptide is added in an assay buffer (100 mM KC1, 20 mM HEPES pH 
7.6, 1 mM MgCl 2 , 1% glycerol, 0.5% NP-40, 50 mM beta-mercaptoethanol, 1 mg/ml 
BSA, cocktail of protease inhibitors) along with a test agent to the wells of a Neutralite- 
10 avidin coated assay plate and incubated at 25°C for 1 hour. Biotinylated substrate is then 
added to each well and incubated for 1 hour. Reactions are stopped by washing with PBS, 
and counted in a scintillation counter. Test agents that cause a difference in activity 
relative to control without test agent are identified as candidate IRRTK or p21 modulating 
agents. 

15 

III. Imm uno precipitations and lmmunoblottin» 

For coprecipitation of transfected proteins, 3 x 10 6 appropriate recombinant cells 
containing the GPC proteins are plated on 10-cm dishes and transfected on the following 
day with expression constructs. The total amount of DNA is kept constant in each 

20 transfection by adding empty vector. After 24 h, cells are collected, washed once with 

phosphate4ouffered saline and lysed for 20 min on ice in 1 ml of lysis buffer containing 50 
mM Hepes, pH 7.9, 250 mM NaCl, 20 mM -glycerophosphate, 1 mM sodium 
orthovanadate, 5 mM p-nitrophenyl phosphate, 2 mM dithiothreitol, protease inhibitors 
(complete, Roche Molecular Biochemicals), and 1% Nonidet P-40. Cellular debris is 

25 removed by ccntrifugation twice at 15,000 x g for 15 min. The cell lysate is incubated 
with 25 /xl of M2 beads (Sigma) for 2 h at 4 °C with gentle rocking. 

After extensive washing with lysis buffer, proteins bound to the beads are 
solubilized by boiling in SDS sample buffer, fractionated by SDS-polyacrylamide gel 
electrophoresis, transferred to poly vinylidene difluoride membrane and blotted with the 

30 indicated antibodies. The reactive bands are visualized with horseradish peroxidase 
coupled to the appropriate secondary antibodies and the enhanced chemiluminescence 
(ECL) Western blotting detection system (Amersham Pharmacia Biotech). 
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IV. Expression analysis 

All cell lines used in the following experiments are NCI (National Cancer Institute) 
lines, and are available from ATCC (American Type Culture Collection, Manassas, VA 
201 1 0-2209). Normal and tumor tissues were obtained from Impath, UC Davis, Clontech, 
5 Stratagene, and Ambion. 

TaqMan analysis wa^ ii^d"to-a^sess-cxpressiorrteveIs of the disci osed genes in various 
samples. 

RNA was extracted from each tissue sample using Qiagen (Valencia, CA) RNeasy 
kits, following manufacturer's protocols, to a final concentration of 50ng/ul. Single 

1 0 stranded cDNA was then synthesized by reverse transcribing the RNA samples using 
random hexamers and 500ng of total UNA per reaction, following protocol 4304965 of 
Applied Biosystems (Foster City, CA). 

Primers for expression analysis using TaqMan assay (Applied Biosystems, Foster City, 
CA) were prepared according to the TaqMan protocols, and the following criteria: a) 

15 primer pairs were designed to span nitrons to eliminate genomic contamination, and b) 
each primer pair produced only one product. 

Taqman reactions were carried out following manufacturer's protocols, in 25 ul total 
volume for 96-well plates and 10 ul total volume for 384-well plates, using 300nM primer 
and 250 nM probe, and approximately 25ng of cDNA. The standard curve for result 

20 analysis was prepared using a universal pool of human cDNA samples, which is a mixture 
of cDNAs from a wide variety of tissues so that the chance that a target will be present in 
appreciable amounts is good. The raw data were normalized using 18S rRNA (universally 
expressed in all tissues and cells). 

For each expression analysis, tumor tissue samples were compared with matched 

25 normal tissues from the same patient. A gene was considered overexpressed in a tumor 
when the level of expression of the gene was 2 fold or higher in the tumor compared with 
its matched normal sample. In cases where normal tissue was not available, a universal 
pool of cDNA samples was used instead. In these cases, a gene was considered 
overexpressed in a tumor sample when the difference of expression levels between a 

30 tumor sample and the average of all normal samples from the same tissue type was greater 
than 2 times the standard deviation of all normal samples (i.e., Tumor - average(all normal 
samples) > 2 x STDEV(all normal samples) ). 

Results are shown in Table 1. Data presented in bold indicate that greater than 50% of 
tested tumor samples of the tissue type indicated in row 1 exhibited over expression of the 
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gene listed in column 1, relative to normal samples. Underlined data indicates that 
between 25% to 49% of tested tumor samples exhibited over expression. A modulator 
identified by an assay described herein can be further validated for therapeutic effect by 
administration to a tumor in which the gene is overexpressed. A decrease in tumor growth 
5 confirms therapeutic utility of the modulator. Prior to treating a patient with the 

modulator, lite likelihood that-the-f*a^iea^will--r«speHd- to tre-atH»nt can be diagnosed by 
obtaining a tumor sample from the patient, and assaying for expression of the gene 
targeted by the modulator. The expression data for the gene(s) can also be used as a 
diagnostic marker for disease progression. The assay can be performed by expression 
10 analysis as described above, by antibody directed to the gene target, or by any other 
available detection method. 



Table 1 





breast 






colon 






kidney 






lung 






ovary 




GI#4504080 
(SEQ1DN0:1) 


0 


3 




9 


26 




4 


19 




6 


14 




0 


4 


GI#5665747 
(SEQ ID NO:23) 


0 


3 




6 


26 




8 


19 




3 


14 




2 


4 


GM4763902 
(SEQ ID NO:24) 


0 


3 




8 


26 




2 


19 




1 


14 




0 


4 


GI#1 3632295 
(SEQ ID NO:5) 


1 


3 




8 


26 




4 


19 




1 


14 




1 


4 


GI#5360214 
(SEQ ID NO:6) 


1 


3 




0 


26 




1 


19 




0 


14 




3 


4 


GI#4877642 
(SEQ ID NO:8) 


1 


3 




9 


26 




6 


19 




3 


14 




0 


4 
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WHAT IS CLAIMED IS: 

1 . A method of identifying a candidate IRRTK or p21 pathways modulating agent, 
said method comprising the steps of: 

5 (a) providing an assay system comprising a purified GPC polypeptide or nucleic 

-agM-of-a- fu n cti onall y a ctiv e fragment or derivative thereof; — 

(b) contacting the assay system with a test agent under conditions whereby, but for 
the presence of the test agent, the system provides a reference activity; and 

(c) detecting a test agent-biased activity of the assay system, wherein a difference 
10 between the test agent-biased activity and the reference activity identifies the test agent as 

a candidate IRRTK or p21 pathways modulating agent. 

2. The method of Claim 1 wherein the assay system comprises cultured cells that 
express the GPC polypeptide. 

15 

3. The method of Claim 2 wherein the cultured cells additionally have defective 
IRRTK or p21 function. 

4. The method of Claim 1 wherein the assay system includes a screening assay 
20 comprising a GPC polypeptide, and the candidate test agent is a small molecule 

modulator. 

5. The method of Claim 4 wherein the assay is a binding assay. 

25 6. The method of Claim 1 wherein the assay system is selected from the group 

consisting of an apoptosis assay system, a cell proliferation assay system, an angiogenesis 
assay system, and a hypoxic induction assay system. 

7. The method of Claim 1 wherein the assay system includes a binding assay 
30 comprising a GPC polypeptide and the candidate test agent is an antibody. 

8. The method of Claim 1 wherein the assay system includes an expression assay 
comprising a GPC nucleic acid and the candidate test agent is a nucleic acid modulator. 
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9. The method of claim 8 wherein the nucleic acid modulator is an antisense 
oligomer. 



10. The method of Claim 8 wherein the nucleic acid modulator is a PMO. 

5 

11 TJie..mem0d.M..C^ 

(d) administering the candidate IRRTK or p21 pathways modulating agent 
identified in (c) to a model system comprising cells defective in IRRTK or p21 function 
and, detecting a phenotypic change in the model system that indicates that the IRRTK or 

10 p21 function is restored. 

12. The method of Claim 1 1 wherein the model system is a mouse model with 
defective IRRTK or p21 function. 

15 13. A method for modulating a IRRTK or p21 pathways of a cell comprising 

contacting a cell defective in IRRTK or p21 function with a candidate modulator that 
specifically binds to a GPC polypeptide comprising an amino acid sequence selected from 
group consisting of SEQ ID NOs:13, 14, 15, 16, 17, 18, 19, 20, 21, and 22, whereby 
IRRTK or p21 function is restored. 

20 

14. The method of claim 13 wherein the candidate modulator is administered to a 
vertebrate animal predetermined to have a disease or disorder resulting from a defect in 
IRRTK or p21 function. 

25 15. The method of Claim 13 wherein the candidate modulator is selected from the 

group consisting of an antibody and a small molecule. 

16. The method of Claim 1, comprising the additional steps of: 

(d) providing a secondary assay system comprising cultured cells or a non-human 
30 animal expressing GPC , 

(e) contacting the secondary assay system with the test agent of (b) or an agent 
derived therefrom under conditions whereby, but for the presence of the test agent or agent 
derived therefrom, the system provides a reference activity; and 

(f) detecting an agent-biased activity of the second assay system, 
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wherein a difference between the agent-biased activity and the reference activity of the 
second assay system confirms the test agent or agent derived therefrom as a candidate 
IRRTK or p21 pathways modulating agent, 

and wherein the second assay detects an agent-biased change in the IRRTK or p21 
5 pathways. 

17. The method of Claim 16 wherein the secondary assay system comprises cultured 
cells. 

10 18. The method of Claim 16 wherein the secondary assay system comprises a non- 

human animal. 

19. The method of Claim 18 wherein the non-human animal mis-expresses a IRRTK 
or p21 pathways gene. 

15 

20. A method of modulating IRRTK or p21 pathways in a mammalian cell comprising 
contacting the cell with an agent that specifically binds a GPC polypeptide or nucleic acid. 

21. The method of Claim 20 wherein the agent is administered to a mammalian 

20 animal predetermined to have a pathology associated with the IRRTK or p21 pathways. 

22. The method of Claim 20 wherein the agent is a small molecule modulator, a 
nucleic acid modulator, or an antibody. . 

25 23. A method for diagnosing a disease in a patient comprising: 

(a) obtaining a biological sample from the patient; 

(b) contacting the sample with a probe for GPC expression; 

(c) comparing results from step (b) with a control; 

(d) determining whether step (c) indicates a likelihood of disease. 

30 

24. The method of claim 23 wherein said disease is cancer. 

25. The method according to claim 24, wherein said cancer is a cancer as shown in 
Table t as having >25% expression level, 
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SEQUENCE LISTING 

<110> EXKLIXIS, INC. 

<120> GPCs AS MODIFIERS OF THE IRRTK AND p21 PATHWAYS AND METHODS OF 

USE 

<130> EX02-081C-PC 

^-i-S'O-i* - — &Q-/~3 r Q&y~Q-3?&- — ~~ 

<151> 2001-07-12 

<150> US 60/328,507 
<151> 2001-10-10 

<160> 24 

<170> Patent In version 3.1 

<210> 1 

<211> 3692 

<212> DNA 

<213> Homo sapiens 



<400> 1 



ggctgcccga 


qcgagcgttc 


ggacctcgca 


ccccgcgcgc 


cccgcgccgc 


cgccgccgcc 


60 


ggcttttgtt 


gtctccgcct 


cctcggccgc 


cgccgcctct 


ggaccgegag 


ccgcgcgcgc 


120 


cgggaccttg 


gctctgccct 


tcgcgggcgg 


gaactgcgca 


ggacccggcc 


aggatccgag 


1 O A 


agaggcgcgg 


gcgggtggcc 


gggggcgccg 


ccggccccgc 


catggagctc 


cgggcccgag 


240 


gctggtggct 


gctatgtgcg 


gccgcagcgc 


tggtcgcctg 


cgcccgcggg 


gacccggcca 


300 


gcaagagccg 


gagctgcggc 


gaggtccgcc 


agate tacgg 


agecaaggge 


ttcagcctga 


360 


gcgacgtgcc 


ccaggcggag 


atctcgggtg 


agcacctgcg 


gatctgtccc 


cagggctaca 


420 


cctgctgcac 


cagcgagatg 


gaggagaacc 


tggccaaccg 


cagccatgcc 


gagctggaga 


480 


ccgcgctccg 


ggacagcagc 


cgcgtcctgc 


aggecatget 


tgccacccag 


ctgcgcagct 


540 


tcgatgacca 


cttccagcac 


ctgctgaacg 


acteggageg 


gaegctgeag 


gccaccttcc 


600 


ccggcgcctt 


cggagagctg 


tacacgcaga 


aogegaggge 


cttccgggac 


ctgtactcag 


660 


agctgcgcct 


gtactaccgc 


ggtgccaacc 


tgcacctgga 


ggagacgctg 


gecgagttet 


720 


gggcccgcct 


gctcgagcgc 


ctcttcaagc 


agctgcaccc 


ccagctgctg 


ctgcctgatg 


780 


actacctgga 


ctgcctgggc 


aagcaggccg 


aggegctgeg 


gcccttcggg 


gaggccccga 


840 


gagagctgcg 


cctgcgggcc 


acccgtgcct 


tcgtggctgc 


tcgctccttt 


gtgeagggee 


900 


tgggcgtggc 


cagcgacgtg 


gtccggaaag 


tggctcaggt 


ccccctgggc 


ccggagtgct 


960 


cgagagctgt 


catgaagctg 


gtctactgtg 


ctcactgcct 


gggagtcccc 


ggcgccaggc 


1020 


cctgccctga 


ctattgccga 


aatgtgctca 


agggctgect 


tgccaaccag 


gccgacctgg 


1080 


acgccgagtg 


gaggaacctc 


ctggactcca 


tggtgctcat 


caccgacaag 


ttctggggta 


1140 
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catcgggtgt 


ggagagtgtc 


atcggcagcg 


tgcacacgtg 


gctggcggag 


gcc atcaacg 


1 o no 


ccctccagga 


caacagggac 


acgctcacgg 


ccaaggtcat 


cc agggctgc 


gggaacccca 


1260 


aggtcaaccc 


ccagggccct 


gggcctgagg 


agaagcggcg 


ccggggcaag 


ctggccccgc 


1320 


gggagaggcc 


accttcaggc 


acgctggaga agctggtctc 


tgaagccaag 


gcccagctcc 


1380 


gcgaagtcaa... 




atcagcctcc 


cagggacact 


-atgcagtgag. 


aagatggccc 


1440 


tgagcactgc 


cagtgatgac 


cgctgctgga 


acgggatggc 


cagaggccgg 


tacctccccg 


1500 


aggtcatggg 


tgacggcctg 


gccaaccaga 


tcaacaaccc 


cgaggtggag 


gtggacatca 


1560 


ccaagccgga 


catgaccatc 


cggcagcaga 


tcatgcagct 


gaagatcatg 


accaaccggc 


1620 


tgcgcagcgc 


ctacaacggc 


aacgacgtgg 


acttccagga 


cgccagtgac 


gacggcagcg 


1680 


gctcgggcag 


cggtgatggc 


tgtctggatg 


acctctgcgg 


ccggaaggtc 


agcaggaaga 


1740 


gctccagctc 


ccggacgccc 


ttgacccatg 


ccctcccagg 


cctgtcagag 


caggaaggac 


1800 


agaagacctc 


ggctgccagc 


tgcccccagc 


ccccgacctt 


cctcctgccc 


ctcctcotct 


1860 


tcctggccct 


tacagtagcc 


aggccccggt 


ggcggtaact 


gccccaaggc 


occagggaca 


1920 


gaggccaagg 


actgactttg 


ccaaaaatac 


aacacagacg 


atatttaatt 


cacctcagcc 


1980 


tggagaggcc 


tggggtggga 


cagggagggc 


cggcggctct 


gagcaggggc 


aggcgcagag 


2040 


gtcccagccc 


caggcctggc 


ctcgcctgcc 


tttctgcctt 


ttaattttgt 


atgaggtcct 


2100 


caggtcagct 


gggagccagt 


gtgcccaaaa 


gccatgtatt: 


tcagggacct 


caggggcacc 


2160 


tccggctgcc 


tagccctccc 


cccagctccc 


tgcaccgccg 


cagaagcagc 


ccctcgaggc 


2220 


ctacagagga 


ggcctcaaag 


caacccgctg 


gagcccacag 


cgagcctgtg 


ccttcctccc 


2280 


cgcctcctcc 


cactgggact 


cccagcagag 


cccaccagcc 


agccctggcc 


caccccccag 


2340 


catccagaga 


agccccgcac 


gggctgtctg 


ggtgtccgcc 


atccagggtc 


tggcagagcc 


2400 


tctgagatga 


tgcatgatgc 


cctcccctca 


gcgcaggctg 


cagagcccgg 


ccccacctcc 


2460 


ctgcgccctt 


gaggggcccc 


agcgtctgca 


gggtgacgcc 


tgagacagca 


ccactgctga 


2520 


ggagtctgag 


gactgtcctc 


ccacagaccc 


tgcagtgagg 


ggccctccat 


gcgcagatga 


2580 


ggggccactg 


acccacctgc 


gcttctgctg 


gaggagggga 


agctgggccc 


aaaggcccag 


2640 


ggaggcagcg 


tgggctctgc 


caatgtgggc 


tgcccctcgc 


acacagggct 


cacagggcag 


2700 






tgttggagga 


ccccgagggc 


tcracroaQcag 


ccaggacccg 


2760 


cctgctccca 


tcctcaccca 


gatcaggaac 


cagggcctcc 


ctgttcacgg 


tgacacaggt 


2820 


cagggctcag 


agtgaccctc 


ggctgtcacc 


tgctcacagg 


gatgctggtg 


gctggtgaga 


2880 


ccccgcactg 


cacacgggaa 


tgcctaggtc 


ccttcccgac 


ccagccagct 


gcactgcagg 


2940 


gcacggggac 


ctggatagtt 


aagggctttt 


ccaaacatgc 


atccatttac 


tgacacttcc 


3000 
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tgtccttgtt catggagagc tgttcgctcc tcccagatgg cttcggaggc ccgcagggcc 3060 
caccttggac cctggtgacc tcctgtcact cactgaggcc atcagggccc tgccccaggc 3120 
ctggacgggc cctccttccc tcctgtgccc cagctgccag gtggccctgg ggaggggtgg 3180 
tgtggtgttg ggaaggggtc ctgcaggggg aggaggactt ggagggtctg ggggcagctg 3240 

- tcctrgaaxrcg-act g a c c e t g agga g gecge fctagtgetgc tttgctt fefee- - ateea-eegfeee 3 3 0 0 



cgcacagtgg acggaggtcc ccggttgctg gtcaggtccc catggcttgt tctctggaac 3 3 60 

ctgactttag atgttttggg atcaggagcc cccaacacag gcaagtccac cccataataa 3420 

ccctgccagt gccagggtgg gctggggact ctggcacagt gatgccgggc gccaggacag 3480 

cagcactccc gctgcacaca gacggcctag gggtggcgct cagaccccac cctacgctca 3 540 

tctctggaag gggcagccct gagtggtcac tggtcagggc agtggccaag cctgctgtgt 3 600 

ccttcctcca caaggtcccc ccaccgctca gtgtcagcgg gtgacgtgtg ttcttttgag 3660 

tccttgtatg aataaaaggc tggaaaccta aa 3 692 

<210> 2 

<211> 495 

<212> DNA 

<213> Homo sapiens 

<400> 2 

ctctgtccag gtgagcacct; ccgggtctgt ccccaggagt acacctgctg ttccagtgag 60 

acagagcaga ggctgatcag ggagactgag gccaccttcc gaggcctggt ggaggacagc 120 

ggctcctttc tggttcacac actggctgcc aggcacagaa aatttgatga gttttttctg 180 

gagatgctct cagtagccca gcactctctg acccagctct tctcccactc ctacggccgc 240 

ctgtatgccc agcacgccct catattcaat ggcctgttct ctcggctgcg agacttctat 300 

ggggaatctg gtgaggggtt ggatgacacc ctggcggatt tctgggcaca gctcctggag 360 

agagtgttcc cgctgctgca cccacagtac agcttccccc ctgactacct gctctgcctc 420 

tcacgcttgg cctcatctac cgatggctct ctgcagccct ttggggactc accccgccgc 480 

ctccgcctgc aggtg 495 

<210> 3 

<211> 1283 

<212> DNA 

<213> Homo sapiens 

<400> 3 

tcggcagatg ccgcctggtc cagctatcgt gctcggtatt cagttttccg gagcagcgct 60 

ctttctctgg cccgcggagc ggtcccgcgg ccgagtaccg gattcccgag tttgggaggc 12 0 

tctgctttcc tccttaggac ccactttgcc gtcctggggt ggctgcagtt atgtccgcgc 180 
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tgcgacctct cctgcttctg ctgctgcctc tgtgtcccgg tcctggtccc ggacccggga 240 
gcgaggcaaa ggtcacccgg agttgtgcag agacccggca ggtgctgggg gcccggggat 300 
atagcttaaa cctaatccct cccgccctga tctcaggtga gcacctccgg gtctgtcccc 360 
aggagtacac ctgctgttcc agtgagacag agcagaggct gatcagggag actgaggcca 42 0 

. os-fefeG-Ggagg- - ■oe fc g gfcg ga g ■ g a c a gcggct cctrttrfrggf tcnr . aaacta . g ctgcc a ga c 4 8.0 , 



acagaaaatt tgatgataac ccggaccctg gtggctgccc gagcctttgt gcagggcctg 540 

gagactggaa gaaatgtggt cagcgaagcg cttaaggtgc cggtgtctga aggctgcagc 600 

caggctctga tgcgtctcat cggctgtccc ctgtgccggg gggtcccctc acttatgccc 660 

tgccagggct tctgcctcaa cgtggttcgt ggctgtctca gcagcagggg actggagcct 720 

gactggggca actatctgga tggtctcctg atcctggctg ataagctcca gggccccttt 780 

tcctttgagc tgacggccga gtccattggg gtgaagatct cggagggtitt gatgtacctg 840 

caggaaaaca gtgcgaaggt gtccgcccag gtgtttcagg agtgcggccc ccccgacccg 900 

gtgcctgccc gcaaccgtcg agccccgccg ccccgggaag aggcgggccg gctgtggtcg 960 

atggtgaccg aggaggagcg gcccacgacg gccgcaggca ccaacctgca ccggctggtg 1020 

tgggagctcc gcgagcgtct ggcccggatg cggggcttct gggcccggct gtccctgacg 1080 

gtgtgcggag actctcgcat ggcagcggac gcctcgctgg aggcggcgcc ctgcUggacc 1140 

ggagccgggc ggggccggta cttgccgcca gtggtcgggg gctccccggc cgagcaggtc 1200 

aacaaccccg agctcaaggt ggacgcctcg ggccccgatg tcccgacacg gcggcgtcgg 12 60 

ctacagctcc gggcggccac ggc 12 83 



<210> 4 

<211> 2312 

<2-12> DNA 

<213> Homo sapiens 

<400> 4 



ccctgccccg 


cgccgccaag 


cggttccogc 


cctcgcccag 


cgcccaggta 


gctgcgagga 


60 


aacttttgca 


gcggctgggt 


agcagcacgt 


ctcttgctcc 


tcagggccac 


tgccaggctt 


120 


gccgagtcct 


gggactgctc 


tcgctccggc 


tgccactctc 


ccgcgctctc 


ctagctccct 


180 


gcgaagcagg 


atggccggga 


ccgtgcgcac 


cgcgtgcttg 


gtggtggcga 


tgctgctcag 


240 


cttggacttc 


ccgggacagg 


cgcagccccc 


gccgccgccg 


ccggacgcca 


cctgtcacca 


300 


agtccgctcc 


ttcttccaga 


gactgcagcc 


cggactcaag 


tgggtgccag 


aaactcccgt 


360 


gccaggatca 


gatttgcaag 


tatgtctccc 


taagggccca 


acatgctgct 


caagaaagat 


420 


ggaagaaaaa 


taccaactaa 


cagcacgatt 


gaacatggaa 


cagctgcttc 


agtctgcaag 


480 
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tatggagctc 


aagttcttaa 


ttattcagaa 


tgttgttcgc 


catgccaaga 


actacaccaa 


gactccacaa 


gcttttgagt 


ttgtgggtga 


gggttctgac 


atcaatgtag 


atgacafcggt 


catctatacc 


cagctaatga 


acccaggcct 


cctccgagga 


gcaagacgtg 


acctgaaagt 


ccaggtttcc 


aagtcactgc 


aagtcaetag 


tgaagtgatc 


aacacaactg 


atcacctgaa 


cagaatgtgg 


tactgctctt 


actgccaggg 


ctgcaatgtg 


gtcatgcaag 


gctgtatggc 


agaatacatt 


ctgtcccttg 


aagaacttgt 


gaacgtactg 


cttggtctct 


tttcaacaat 


tgcaggaaag 


ctgaccacca 


ctattggcaa 


tagatctgct 


tattatcctg 


aagatctctt 


tgtagaacat 


gaagaaacct 


tatccagccg 


tttcatcagc 


ttctatagtg 


ctttgccugg 


aaacgacacc 


ctttgctgga 


atggacaaga 


aaggaatgga 


atgaaaaacc 


agttcaatct 


agtggtcagt 


caaattattg 


acaaactgaa 


tatgcccaaa 


ggtagagttc 


tggataaaaa 


ctgcggtgat 


gatgaagatg 


agtgcattgg 


gaatcagctc 


cgcttccttg 


cagaactggc 


aaacagtcag 


caggcaactc 


cgaaggacaa 


cgttcattcc 


ccgctgaagc 


ttctcaccag 


cctggtgcac 


tgactgcctg 


gtgcccagca 


tcctcgataa 


agggaaccac 


tttcttattt 


atacctcctc 


cagccatgaa 


gtagaggact 


ggtatctttt 


ggaggaagat 


acattttagt 


aagaaaaaaa 


accatcaagt 


tgtgccaaat 


gttaccatgt 


ctttctctct 


cactccctcc 


ctttgaaaaa 


aaataaattg 


ctcaaataaa 
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tgctgcggtt 


ttccaagagg 


cctttgaaat 


540 


tgccatgttc 


aagaacaact 


acccaagcct 


600 


atttttcaca 


gatgtgtctc 


tctacatctt 


660 


caatgaattg 


tttgacagcc 


tgtttccagt 


720 


gcctgattca 


gccttggaca 


tcaatgagtg 


780 


atttgggaat 


ttccccaagc 


ttattatgac 


840 


gatcttcctt 


caggctctga 


atcttggaat 


900 


gttcagtaag 


gactgtggcc 


gaatgctcac 


960 


actgatgatg 


gttaaaccct 


gtggcggtta 


1020 


aggtgtggtg 


gagattgaca 


agtactggag 


1080 


gaatggcatg 


tacagaatct 


atgacatgga 


1140 


ccatgattct 


atccagtatg 


tccagaagaa 


1200 


gttatgtgcc 


cattctcaac 


aacgccaata 


1260 


tattgacaag 


aaagtattaa 


aagttgctca 


1320 


aagaagggaa 


ctaattcaga 


agttgaagtc 


13 80 


ctacatctgc 


agccatagcc 


ctgtggcgga 


1440 


actcgtggag 


agatacagcc 


aaaaggcagc 


1500 


cr.\'it.gat;c:ug 


aaaatgaagg 


gccctgagcc 


1560 


gcacattaac 


cagctcctga 


gaaccatgtc 


1620 


cctggatgag 


gaagggtttg 


aaagtggaga 


1680 


aggctctggt 


gatggaatga 


taaaagtgaa 


1740 


ctatgatctg 


gatgtggatg 


atgcgcctgg 


1800 


cgagataagc 


acctttcaca 


acctcgggaa 


1860 


catggccatc 


tcggtggtgt 


gcttcttctt 


1920 


catgtgctgc 


cctacagcac 


cctgtggtct 


1980 


ttttctattt 


tttttttttt 


gttatcctgt 


2040 


aaccatgtgt 


tatgttttcg 


aaaatcaaat 


2100 


ggtagcatat 


agattgtcct 


tttgcaaaga 


2160 


tattctccta 


tgtttggctg 


ctagaacatg 


2220 


ctttctatcg 


ttctctcttt 


gcatggattt 


2280 


aa 






2312 
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<210> 5 

<2U> 3714 

<212> DNA 

<213> Homo sapiens 

<400> 5 

gcctggcacc ggggaccgtt gcctgacgcg aggcccagct ctacttttcg ccccgcgtct 60 

-e&fceegeg-feg ctcgcotctb— sgaccaactc- caactccttc tccctccage tcca.cfcc.gc.fc. 12- Q. 

agtccccgac tccgccagcc ctcggcccgc tgccgtagcg ccgcttcccg tccggtccca 180 

aaggtgggaa cgcgtccgcc ccggcccgca ccatggcacg gttcggcttg cccgcgcttc 240 

tctgcaccct ggcagtgctc agcgccgcgc tgctggctgc cgagctcaag tcgaaaagtt 300 

geteggaagt gegaegtett tacgtgtcca aaggcttcaa caagaacgat gcccccctcc 3 60 

acgagatcaa eggtgatcat ttgaagatct gtccccaggg ttctacctgc tgctctcaag 42 0 

agatggagga gaagtacagc ctgcaaagta aagatgattt caaaagtgtg gtcagegaac 480 

agtgcaatca tttgeaaget gtctttgett cacgttacaa gaagtttgat gaattcttca 540 

aagaactact tgaaaatgea gagaaatccc tgaatgatat gtttgtgaag acatatggee 600 

atttatacat gcaaaattct gagctattta aagatctctt cgtagagttg aaacgttact 660 

acgtggtggg aaatgtgaac ctggaagaaa tgctaaatga cttctgggct cgcctcctgg 720 

agcggatgtt ccgcctggtg aactcccagt. accactttac agatgagtat ctggaatgtg 780 

tgagcaagta tacggagcag ctgaagccct teggagatgt ccctcgcaaa ttgaagctcc 840 

aggttactcg tgcttttgta gcagcccgta ctttcgctca aggcttagcg gttgegggag 900 

atgtcgtgag caaggtctcc gtggtaaacc ccacagccca gtgtacccat gccctgttga 960 

agatgatcta ctgctcccac tgccggggtc tcgtgactgt gaagccatgt tacaactact 1020 

gctcaaacat catgagaggc tgtttggcca accaagggga tctcgatttt gaatggaaca 1080 

atttcataga tgctatgctg atggtggcag agaggctaga gggtcctttc aacattgaat 1140 

cggtcatgga tcccatcgat gtgaagattt ctgatgetat tatgaacatg caggataata 1200 

gtgttcaagt gtctcagaag gttttccagg gatgtggacc ccccaagccc ctcccagctg 1260 

gacgaatttc tcgttccatc tctgaaagtg ccttcagtgc tegcttcaga ccacatcacc 132 0 

ccgaggaacg cccaaccaca gcagctggca ctagtttgga ccgactggtt actgatgtca 1380 

aggagaaact gaaacaggee aagaaattct ggtcctccct tccgagcaac gtttgcaacg 1440 

atgagaggat ggctgcagga aacggcaatg aggatgactg ttggaatggg aaaggcaaaa 1500 

gcaggtacct gtttgcagtg acaggaaatg gattagecaa ccagggcaac aacccagagg 1560 

tccaggttga caccagcaaa ccagacatac tgatccttcg tcaaatcatg gctcttcgag 1620 

tgatgaccag caagatgaag aatgeataca atgggaacga cgtggacttc tttgatatca 1680 
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gtgatgaaag tagtggagaa ggaagtggaa gtggctgtga gtatcagcag tgcccttcag 1740 

agtttgacta caatgccact gaccatgctg ggaagagtgc caatgagaaa gccgacagtg 1800 

ctggtgtccg tcctggggca caggcctacc tcctcactgt cttctgcatc ttgttcctgg 1860 

ttatgcagag agagtggaga taattctcaa actctgagaa aaagtgttca tcaaaaagtt 192 0 

-aaaagge&os— agfe-faa tcact ttfc c ta c cat cct a gfcgacfc ttgcttttta aatgaatgga 19S.Q. 

caacaatgta cagtttttac tatgtggcca ctggtttaag aagtgctgac tttgttttct 2040 

cattcagttt tgggaggaaa agggactgtg cattgagttg gttcctgctc ccccaaacca 2100 

tgttaaacgt ggctaacagt gtaggtacag aactatagtt agttgtgcat ttgtgatttt 2160 

atcactctat tatttgtttg tatgtttttt tctcatttcg tttgtgggtt tttttttcca 2220 

acfcgtgatct cgccttgttt cttacaagca aaccagggtc ccttcttggc acgtaacatg 2280 

tacgtatttc tgaaatafcta aatagctgta cagaagcagg ttttatttat catgttatct 2340 

tattaaaaga aaaagcccaa aaagcagtaa aatttccatt tctccctgtt attttagttg 2400 

ccttatctgg agagacgtgg aggtgatttt ctttttttta aattattatt aagacagaat 2460 

gtgagggcac aagcaggctt ctgagccact tgtcagattg tattcaaagc atcaatccaa 2520 

gaaggaggtt atgtgtactt catttattgg tgatagttgg aagagactgc agactactgc 2580 

tttgaatgag ttgaattaca taagctaaga tcactatagg tccatttctt gaacccactt 2640 

atacataaaa tgtaacccat ttagaaaaag attctggata tcatccccct tgaaagatag 2700 

aaagcattca ggatgtccca gttatcacat gttcacactt gggtttaggg gtgttttttt 27 60 

ttaaaaccag gcaggttagc tagcccaccc tgtgctagtt ttcatgttca cactgaccct 2820 

atttgaatta atatcctttg ttagagtggt cgagatttca aacccaatta tgtacaggga 2880 

gctgtctgag agctagccag aactggggta cagcctgggc tcagggaata gctgtcaaca 2940 

ctcgggcaaa gtttttgtct gtgcatgtgt atctccattt gttttgggat cccagttttt 3000 

gttttaagag agtataaggt gtctcatttg agtctttttc ttacctagcc ccctcttatc 3060 

agtaaaacaa aggacttgcc atggttcaca gcaatgtgct acgatccaag atatcagcca 312 0 

aggagcccac ttaggggaga actaggtgtc cagatttttg tatgtgttgt ttttcttggg 3180 

ggatggggtg gggtgggagt aggtagagct gagaatacta catcttagtg gtgaccttta 3240 

gccacgtggg tgaagtggca aaggccatgg ccatatctgt tgtcccaggc caaagactaa 3300 

caactgcctt gggaatccct tccttgtgtc cttaccaaat gatagctcat aaaactctga 33 60 

taatgtaaca aatcactfctc aaaggagttc ccagaagtct tcagaaagac taaaattctg 342 0 

tctcttcctg ctttagacag ccattaagat cccaactaat tttaccgaac ctaaaaccca 3480 

caaagaggtt gtttgtgtta ttgttcaatc ttcagttgta agagtaattc tctattttta 3540 
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tattgaaaca taattacttg atagctcagg gtctacattt cattcaactt tttacaccaa 3600 
attctgcaga gtggtcaaaa tggaatattg ggggctgttg taaacagagg cttaatttta 3660 
ttagaagtag ccagttattt attaaagcat gatgttaata aaataggcat attc 3714 

<210> 6 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> m±sc_f eature 

<222> (2877) .. (2877) 

<223> "n" is A, C, G, or T 

<220> 

<221> misc_f eature 

<222> (3380) .. (3380) 

<223> "n" is A, C, G,. or T 

<400> 6 



gcctggcacc 


ggggaccgtt 


gcctgacgcg 


aggcccagct 


ctacttttcg 


ccccgcgtct 


60 


cctccgcctg 


ctcgcctctt 


ccaccaactc 


caactccttc 


tccctccagc 


tccactcgct 


120 


agtccccgac 


tccgccagcc 


ctcggoccgc 


tgccgtagcg 


ccgcttcccg 


tccggtccca 


180 


aaggtgggaa 


cgtgtccgcc 


ccggcccgca 


ccatggoacg 


gttcggcttg 


cccgcgcttc 


240 


tctgcaccct 


ggcagtgctc 


agcgccgcgc 


tgctggctgc 


cgagctcaag 


tcgaaaagtt 


300 


gctcggaagt 


gcgacgtctt 


tacgtgtcca 


aaggcttcaa 


caagaacgat 


gcccccctcc 


360 


acgagatcaa 


cggtgatcat 


ttgaagatct 


gtccccaggg 


ttctacctgc 


tgctctcaag 


420 


agatggagga 


gaagtacagc 


ctgcaaagta 


aagatgattt 


caaaagtgtg 


gtcagcgaac 


480 


agtgcaatca 


tttgcaagct 


gtctttgctt 


cacgttacaa 


gaagtttgat 


gaattcttca 


540 


aagaactact 


tgaaaatgca 


gagaaatccc 


tgaatgatat 


gtttgtgaag 


acatatggcc 


600 


atttatacat 


gcaaaattct 


gagctattta 


aagatctctt 


cgtagagttg 


aaacgttact 


660 


acgtggtggg 


aaatgtgaac 


ctggaagaaa 


tgctaaatga 


cttctgggct 


cgcctcctgg 


720 


agcggatgtt 


ccgccfcggtg 


aactcccagt 


accactttac 


agatgagtat 


ctggaatgtg 


780 


tgagcaagta 


tacggagcag 


ctgaagccct 


tcggagatgt 


ccctcgcaaa 


ttgaagctcc 


840 


aggttactcg 


tgcttttgta 


gcagcccgta 


ctttcgctca 


aggcttagcg 


gttgcgggag 


900 


atgtcgtgag 


caaggtctcc 


gtggtaaacc 


ccacagccca 


gtgtacccat 


gccctgttga 


960 


agatgatcta 


ctgctcccac 


tgccggggtc 


tcgtgactgt 


gaagccatgt 


tacaactact 


1020 


gctcaaacat 


catgagaggc 


tgtttggcca 


accaagggga 


tctcgatttt 


gaatggaaca 


1080 
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atttcataga tgctatgctg atggtggcag agaggctaga gggtcctttc aacattgaat 1140 

cggtcatgga tcccatcgat gtgaagattt ctgatgctat tatgaacatg caggataata 1200 

gtgttcaagt gtctcagaag gttttccagg gatgtggacc ccccaagccc ctcccagctg 1260 

gacgaatttc tcgttccatc tctgaaagtg ccttcagtgc tcgcttcaga ccacatcacc 1320 

ccgaggaacg cccaaccaca gcagctggca ctagtttgga ccgactggtt actgatgtca 1380 

aggataaact gaaacaggcc aagaaattct ggtcctccct tccgagcaac gtttgcaacg 1440 

atgagaggat ggctgcagga aacggcaatg aggatgactg ttggaatggg aaaggcaaaa 1500 

gcaggtacct gtttgcagtg acaggaaatg gattagtcaa ccagggcaac aacccagagg 1560 

tccaggttga caccagcaaa ccagacatac tgatccttcg tcaaatcatg gctcttcgag 162 0 

tgatgaccag caagatgaag aatgcataca atgggaacga cgtggacttc tttgatatca 1680 

gtgatgaaag tagtggagaa ggaagtggaa gtggctgtga gtatcagcag tgcccttcag 1740 

agtttgacta caatgccact gaccatgctg ggaagagtgc caatgagaaa gccgacagtg 1800 

ctggtgtccg tcctggggca caggcctacc tcctcactgt cttctgcatc ttgttcctgg 1860 

ttatgcagag agagtggaga taattctcaa actctgagaa aaagtgttca tcaaaaagtt 1920 

aaaaggcacc agttatcact tttctaccat cctagtgact ttgcttttta aatgaatgga 1980 

caacaatgta cagtttttac tatgtggcca ctggtttaag aagtgctgac tttgttttct 2040 

cattcagttt tgggaggaaa agggactgtg cattgagtfcg gttcctgctc ccccaaacca 2100 

tgttaaacgt ggctaacagt gtaggtacag aactatagtt agttgtgcat ttgtgatttt 2160 

atcactctat tatttgtttg tatgtttttt tctcatttcg tttgtgggtt tttttttcca 2220 

actgtgatct cgccttgttt cttacaagca aaccagggtc ccttcttggc acgtaacatg 2280 

tacgtatttc tgaaatatta aatagctgta cagaagcagg ttttatttat catgttatct 2340 

tattaaaaga aaaagcccaa aaagcagtaa aatttccatt tctccctgtt attttagttg 2400 

ccttatctgg agagacgtgg aggtgatttt ctttttttaa attattatta agacagaatg 2460 

tgaaggcaca agcaggcttc tgagccactt gtcagattgt attcaaagca tcaatccaag 2 520 

aggaggttat gtgtacttca tttattggtg atagttggaa gagactgcag actactgctt 2580 

tgaatgagtt gaattacata agctaagatc actataaggt ccatttcttg aacccactta 2 640 

tacataaaat gtaacccatt tagaaaaaga ttctggatat catcccccct tgaaagatag 2700 

aaagcattca ggatgtccca gttatcacat gttcacactt gggtttaggg gtgttttttt 2760 

ttaaaaccag gcaggttagc tagcccaccc tgtgctagtt ttcatgttca cgctgaccct 2 820 

atttgaatta atatcctttg ttagagtggt cgagatttca aacccaatta tgtacangga 2880 

gctgtctgag agctagccag aactggggta cagcctgggc tcagggaata gctgtcaaca 2940 
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ctcgggcaaa gtttttgtct gcagcaacgt gtatcaccat ttgttttggg atccagtttt 3000 

tgttttaaga gagtataagg tgtctcattt gagtcttttt cttacctagc cccctcttat 3060 

cagtaaaaca aaggacttgc catggttcac agcaatgtgc tacgatccaa gatatcaacc 3120 

aaggagccca cttaggggag aactaggtgt ccagattttt gtatgtgttg tttttcttgg 3180 

gggatggggt ggggtgggag taggtagagc tgagaatact acatcttagt ggtgaccttt 3240 

agccacgtgg gtgaagtggc aaaggccatg gccatatctg ttgtcccagg ccaaagacta 3300 

acaactgcct tgggaatccc ttccttgtgt ccttaccaaa tgatagctca taaaactctg 3 360 

ataatgtaac aaatcacttn caaaggagtt cccagaagtc ttcagaaaga ctaaaattct 342 0 

gtctcttcct gctfctagaca gccattaaga tcccaactaa ttttaccgaa cctaaaaccc 3480 

acaaagaggt tgtttgtgtt attgttcaat cttcagttgt aagagtaatt ctctattttt 3540 

atattgaaac ataattactt gatagctcag ggtctacact tcattcaact ttttacacca 3 600 

aattctgcag agtggtcaaa atggaatatt gggggctgtt gtaaacagag gcttaatttt 3 660 

attagaagta gccagttatt tattaaagca tgatgttaat aaaataggca tattccaaaa 3720 

aaaa 3724 

<210> 7 

<211> 1722 

<212> DNA 

<213> Homo sapiens 

<400> 7 



accatggcac 


ggttcggctt 


gcccgcgctt 


ctctgcaccc 


tggcagtgct 


cagcgccgcg 


60 


ctgctggctg 


ccgagctcaa 


gtcgaaaagt 


tgctcggaag 


tgcgacgtct 


ttacgtgtcc 


12 0 


aaaggcttca 


acaagaacga 


tgcccccctc 


cacgagatca 


acggtgatca 


tttgaagatc 


180 


tgtccccagg 


gttctacctg 


ctgctctcaa 


gagatggagg 


agaagtacag 


cctgcaaagt 


240 


aaagatgatt 


tcaaaagtgt 


ggtcagcgaa 


cagtgcaatc 


atttgcaagc 


tgtctttgct 


300 


tcacgttaca 


agaagtttga 


tgaattcttc 


aaagaactac 


ttgaaaatgc 


agagaaatcc 


360 


ctgaatgata 


tgtttgtgaa 


gacata tggc 


catttataca 


tgcaaaattc 


tgagctattt 


420 


aaagatctct 


tcgtagagtt; 


gaaacgtitac 


tacgtggtgg 


gaaatgtgaa 


cctggaagaa 


480 


atgctaaatg 


acttctgggc 


tcgcctcctg 


gagcggatgt 


tccgcctggt 


gaactcccag 


540 


taccacttta 


cagatgagta 


tctggaatgt 


gtgagcaagt 


atacggagca 


gctgaagccc 


600 


ttcggagatg 


tccctcgcaa 


attgaagctc 


caggttactc 


gtgcttttgt 


agcagcccgt 


660 


actttcgctc 


aaggcttagc 


ggttgcggga 


gatgtcgtga 


gcaaggtctc 


cgtggtaaac 


720 


cccacagccc 


agtgtaccca 


tgccctgttg 


aagatgatct 


actgctccca 


ctgccggggt 


780 


ctcgtgactg 


tgaagccatg 


ttacaactac 


tgctcaaaca 


tcatgagagg 


ctgtttggcc 


840 
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aaccaagggg atctcgattt tgaatggaac aatttcatag atgctatgct gatggtggca 900 
gagaggctag agggtccttt caacattgaa tcggtcatgg atcccatcga tgtgaagatt 960 
tctgatgcta ttatgaacat gcaggataat agtgttcaag tgtctcagaa ggttttccag 1020 
ggatgtggac cccccaagcc cctcccagct ggacgaattt ctcgttccat ctctgaaagt 1080 

- gcetteag-hg - -efeege-fcfeeag aocaaatcae cocgaggaao goeaaaccaG agaagc-fegge .1.14-0-- 



actagtttgg accgactggt tactgatgtc aaggagaaac tgaaacaggc caagaaattc 1200 

tggtcctccc ttccgagcaa cgtttgcaac gatgagagga tggctgcagg aaacggcaat 1260 

gaggatgact gttggaatgg gaaaggcaaa agcaggtacc tgtttgcagt gacaggaaat 1320 

ggattagcca accagggcaa caacccagag gtccaggttg acaccagcaa accagacata 1380 

ctgatccttc gtcaaatcat ggctcttcga gtgatgacca gcaagatgaa gaatgcatac 1440 

aatgggaacg acgtggactt ctttgatatc agtgatgaaa gtagtggaga aggaagtgga 1500 

agtggctgtg agtatcagca gtgcccttca gagtttgact acaatgccac tgaccatgct 1560 

gggaagagtg ccaatgagaa agccgacagt gctggtgtcc gtcctggggc acaggcctac 1620 

ctcctcactg tcttctgcat cttgttcctg gttatgcaga gagagtggag ataattctca 1680 

aactctgaga aaaagtgttc atcaaaaagt taaaaggcac ca 1722 



<210> 8 

<211> 2625 

<212> DNA 

<213> Homo sapiens 



<400> 8 



gtcttccacg 


Uctgcagctc 


agccagggcg 


cgcagggcga 


gtggggtcca 


ctggcgggta 


60 


aaggggacca 


ggacggcgag 


gatggacgca 


cagacctggc 


ccgtgggctt 


tcgctgcctc 


120 


ctccttctgg 


ccctggttgg 


gtccgcccgc 


agcgagggcg 


tgcagacctg 


cgaagaagtt 


180 


cggaaacttt 


tccagtggcg 


gctgctggga 


gctgtcaggg 


ggctgccgga 


ttcgccgcgg 


240 


gcaggacctg 


atcttcaggt 


ttgcatatcc 


aaaaagccta 


catgttgcac 


caggaagatg 


300 


gaggagagat 


atcagattgc 


ggctcgccag 


gatatgcagc 


agtttcttca 


aacgtccagc 


360 


tctacattaa 


agtttctaat 


atctcgaaat 


gcggctgctt 


ttcaagaaac 


ccttgaaact 


420 


ctcatcaaac 


aagcagaaaa 


ttacaccagt 


atactttttt 


gcagtaccta caggaacatg 


480 


gccttggagg 


ctgctgcttc 


ggttcaggag 


ttcttcactg 


atgtggggct 


gtatttattt 


540 


ggtgcggatg 


ttaatcctga 


agaatttgta 


aacagatttt 


ttgacagtct 


ttttcctctg 


600 


gtctacaacc 


acctcattaa 


ccctggtgtg 


actgacagtt 


ccctggaata 


ctcagaatgc 


660 


atccggatgg 


ctcgccggga 


tgtgagtcca 


tttggtaata 


ttccccaaag 


agtaatggga 


720 
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cagatgggga 


ggtccctgct 


gcccagccgc 


acttttctgc 


aggcactcaa 


tctgggcatt 


7 80 


gaagtcatca 


acaccacaga 


ctatctgcac 


ttctccaaag 


agtgcagcag 


agccctcctg 


840 


aagatgcaat 


actgcccgca 


ctgccaaggc 


ctggcgctca 


ctaagccttg 


tatgggatac 


900 


tgcctcaatg 


tcatgcgagg 


ctgcctggcg 


cacatggcgg 


agcttaatcc 


acactggcat 


960 


gcatatatcc 


ggtcgttgga 


agaactctcg 


gatgcaatgc 


atggaacata 


cgacattgga 


1020 


cacgtgctgc 


tgaactttca 


cttgcttgtt 


aatgatgctg 


tgttacaggc 


tcacctcaat 


1080 


ggacaaaaat 


tattggaaca 


ggtaaatagg 


atttgtggcc 


gccctgtaag 


aacacccaca 


1140 


caaagccccc 


gttgttcttt 


tgatcagagc 


aaagagaagc 


atggaatgaa 


gaccaccaca 


1200 


aggaacsgtg 


aagagacgct 


tgccaacaga 


agaaaagaat 


ttatcaacag 


ccttcgactg 


1260 


tacaggtcat 


tctatggagg 


tctagctgat 


cagctttgtg 


ctaatgaatt 


agctgctgca 


1320 


gatggacttc 


cctgctggaa 


tggagaagat 


atagtaaaaa 


gttatactca 


gcgtgtggtt 


1380 


ggaaatggaa 


tcaaagccca 


gtctggaaat 


cctgaagtca 


aagtcaaagg 


aattgatcct 


1440 


gtgataaatc 


agattattga 


taaactgaag 


catgttgttc 


agttgttaca 


gggtagatca 


1500 


cccaaacctg 


acaagtggga 


acttcttcag 


ctgggcagtg 


gtggaggcat 


ggttgaacaa 


1560 


gtcagtgggg 


actgtgatga 


tigaagatggt 


tgcgggggat 


caggaagtgg 


agaagtcaag 


1620 


aggacactga 


agatcacaga 


ctggatgcca 


gatgatatga 


acttcagtga 


tgtaaagcaa 


1680 


atccatcaaa 


cagacactgg 


cagtacttta 


gacacaacag 


gagcaggatg 


tgcagtggcg 


174 0 


actgaatcta 


tgacattcac 


tctgataagt 


gtggtgatgt 


tacttcccgg 


gatttggtaa 


1800 


ctgaactctt 


ctgtcctgac 


ataccttact 


gaagtctcga 


tttcttctct 


ctctgcatat 


1860 


gcctggaata 


agagatcctt 


tttcaatgta 


acaattatat 


ttatgaaaag 


atatgttaca 


1920 


ctaacttctc 


agaagccaag 


ctgaaatatt 


cataaagtcc 


ctaaaactca 


acgtttaaat 


1980 


gacacacfctt 


aaaaatatgt 


cttttttcaa 


tctaactgaa 


aaccttctta 


acttctaata 


2040 


tattaaatct 


gaagatgtga 


agggcacaga 


agtgactttg 


aataagaaga 


atttagtgta 


2100 


tctgtaattt 


tattatcaat 


tccaagcccc 


ttcctttcta 


aattaaaaat 


gttttcattt 


2160 


gaaagtgtat 


ttgccagaca 


atgaaaacag 


tatgcagtat 


ttcttaaagt 


attgaaatta 


2220 


gaatatcatg 


aaataaatca 


aaacatacaa 


tggcaagtag 


tatgcatgca 


tattcaagag 


2280 


actcttccat 


ttttgcaagc 


tgtagaagga 


aatgtctgaa 


tgtctataag 


ttatggggta 


2340 


gattcttgag 


aagcatttca 


tataatttca 


ctgaagaacc 


ttgataattt 


tgacccactg 


2400 


taacttagcc 


actgatgaac 


cttaaagctg 


agtattttat 


taacacctga 


tttgtattct 


2460 


attatattca 


aaatgcatct 


ttggtattgt 


gcctctgctc 


ccatctctct 


ctttgcctca 


2520 


tagatttagc 


tatgttggga 


agcacatgct 


tgctctagga 


atatctccaa 


taaagctgtt 


2580 
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2625 



<210> 9 

<211> 1938 

<212> DNA 

<213> Homo sapiens 

<400> 9 

grcTCccaeg tcrtrgcagntc agccagggcg cgcagggcga gLygygLcca ctggcgggta 66 

aaggggacca ggacggcgag gatggacgca cagacctggc ccgtgggctt tcgctgcctc 12 0 

ctccttctgg ccctggttgg gtccgcccgc agcgagggcg tgcagacctg cgaagaagtt 180 

cggaaacttt tccagtggcg gctgctggga gctgtcaggg ggctgccgga ttcgccgcgg 240 

gcaggacctg atcttcaggt ttgcatatcc aaaaagccta catgttgcac caggaagatg 300 

gaggagagat atcagattgc ggctcgccag gatatgcagc agtttcttca aacgtccagc 3 60 

fcctacattaa agtttctaat atctcgaaat gcggctgctt ttcaagaaac ccttgaaact 420 



LLLa U L ei el ta. U 


ci ciy t~ ciy cieictct 


I- L-dv^d' — v— CLy i— 


afactttttt 


gcagtaccta 


caggaacatg 


480 


gc ecu yyayy 




fjfrf" {- f rifi^i f-r 

yy i-i-^ a yy a y 


\— ll. 1 — >-i — 




gtatttattt 


540 


gg ugeggaug 


LLaGLLLLUci 


ny ua L L Ly L.CJ. 




t~ 1" rr^i on crt" c t~ 


ttttcctctg 


600 


gfccfcacaacc 


accticattaa 


c uc L-Cj y uy Lg 




r* ("* t~ rrrr;3 3 t~ ;^ 
lll tyy na lu 


{"* t~ p ^ rr^ ^ t* err* 


660 


ateeggatgg 


ctcgccggga 


tgtgagtcca 


ttttgtaata 


ttccccaaag 


agtaatggga 


720 


cagatgggga 


ggtccctgct 


gcccagccgc 


acttttctgc 


aggcactcaa 


tctgggcatt 


780 


gaagtcatca 


acaccacaga 


ctatctgcac 


ttcttcaaag 


agtgcagcag 


agccctcctg 


840 


aagatgcaat 


actgcccgca 


ctgccaaggc 


ctggcgctca 


etaagecttg 


tatgggatac 


900 


tgcctcaatg 


teatgegagg 


ctgcctggcg 


cacatggegg 


agcttaatcc 


acactggcat 


960 


gcatatatcc 


ggtcgttgga 


agaactctcg 


gatgeaatge 


atggaaoata 


cgacattgga 


1020 


cacgtgctgc 


tgaactttca 


cttgcttgtt 


aatgatgctg 


tgttacaggc 


tcacctcaat 


1080 


ggacaaaaat 


tattggaaca 


ggtaaatagg 


atttgtggcc 


gccctgtaag 


aacacccaca 


1140 


caaagccccc 


gttgttcttt 


tgatcagagc 


aaagagaagc 


atggaatgaa 


gaccaccaca 


1200 


aggaacagtg 


aagagacget 


tgecaacaga 


agaaaagaat 


ttatcaacag 


ccttcgactg 


1260 


tacaggtcat 


tctatggagg 


tctagctgat 


cagctttgtg 


ctaatgaatt 


agetgetgea 


1320 


gatggacttc 


cctgctggaa 


tggagaagat 


atagtaaaaa 


gttatactca 


gcgtgtggtt 


1380 


ggaaatggaa 


tcaaagccca 


gtctggaaat 


cctgaagtca 


aagtcaaagg 


aattgatcct 


1440 


gtgataaatc 


agattattga 


taaactgaag 


catgttgttc 


agttgttaca 


gggtagatca 


1500 


cccaaacctg 


acaagtggga 


acttcttcag 


ctgggcagtg 


gtggaggcat 


ggttgaacaa 


1560 


gtcagtgggg 


actgtgatga 


tgaagatggt 


tgegggggat 


caggaagtgg 


agaagtcaag 


1620 
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aggacactga agatcacaga ctggatgcca gatgatatga acttcagtga tgtaaagcaa 1680 
atccatcaaa cagacactgg cagtacttta gacacaacag gagcaggatg tgcagtggcg 1740 
actgaatcta tgacattcac tctgataagt gtggtgatgt tacttcccgg gatttggtaa 1800 
ctgaactctt ctgtcctgac ataccttact gaagtctcga tttcttctct ctctgcatat 1860 
ccctggaa c a agagatcc tt LUtuaaLuta acaattafcafc ttatga aaag a-t-at-gt fcaea 492£~ 



ctaacttcca gaagccaa 



<210> 10 

<211> 2644 

<212> UNA 

<213> Homo sapiens 

<400> 10 



1938 



agcggccgct 


gaattctagc 


caggatttct 


ctcttcctat 


ttcaggagga 


ctctcacagg 


60 


ctcccacagc 


ctgtgttaag 


ctgaggtttc 


ccctagatct 


cgtatatccc 


caacacatac 


120 


ctccacgcac 


acacatcccc 


aagaacctcg 


agctcacacc 


aacagacaca 


cgcgcgcata 


180 


cacactcgct 


ctcgcttgtc 


catctccctc 


ccgggggagc 


cggcgcgcgc 


tcccaccttt 


240 


gccgcacact 


ccggcgagcc 


gagcccgcag 


cgctccagga 


ttctgcggct 


cggaactcgg 


300 


attgcagctc 


tgaaccccca 


tggtggtttt 


ttaaacactt 


cttttccttc 


tcttcctcgt 


360 


tttgattgca 


ccgtttccat 


ctgggggcta 


gaggagcaag 


gcagcagcct 


tcccagccag 


420 


cccttgttgg 


cttgccatcg 


tccatctggc 


ttataaaagt 


ttgctgagcg 


cagtccagag 


480 


ggctgcgctg 


ctcgtcccct 


cggctggcag 


aagggggtga 


cgctgggcag 


cggcgaggag 


540 


cgcgccgctg 


cctctggcgg 


gctttcggct 


tgaggggcaa 


ggtgaagagc 


gcaccggoog 


600 


tggggtttac 


cgagctggat 


ttgtatgttg 


caccatgcct 


tcttggatcg 


gggctgtgat 


660 


tcttcccctc 


ttggggctgc 


tgctctccct 


ccccgccggg 


gcggatgtga 


aggctcggag 


720 


ctgcggagag 


gtccgccagg 


cgtacggtgc 


caagggattc 


agcctggcgg 


acatccccta 


780 


ccaggagatc 


gcaggggaac 


acttaagaat 


ctgtcctcag 


gaatatacat 


gctgcaccac 


840 


agaaatggaa 


gacaagttaa 


gccaacaaag 


caaactcgaa 


tttgaaaacc 


ttgtggaaga 


900 


gacaagcoat 


tttgtgcgca 


ccacttttgt 


gtccaggcat 


aagaaatttg 


acgaattttt 


960 


ccgagagctc 


ctggagaatg 


cagaaaagtc 


actaaatgat 


atgtttgtac 


ggacctatgg 


1020 


catgctgtac 


atgcagaatt 


cagaagtctt 


ccaggacctc 


ttcacagagc 


tgaaaaggta 


1080 


ctacactggg 


ggtaatgtga 


atctggagga 


aatgctcaat 


gacttttggg 


ctcggctcct 


1140 


ggaacggatg 


tttcagctga 


taaaccctca 


gtatcacttc 


agtgaagact 


acctggaatg 


1200 


tgtgagcaaa 


tacactgacc 


agctcaagcc 


atttggagac 


gtgccccgga 


aactgaagat 


1260 
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tcaggttacc 


cgcgccttca 


ttgctgccag 


gacctttgtc 


caggggctga 


ctgtgggcag 


1320 


agaagttgca 


aaccgagttt 


ccaaggtcag 


cccaacccca 


gggtgtatcc 


gtgccctcat 


1380 


gaagatgctg 


tactgcccat 


actgtcgggg 


gcttcccact 


gtgaggccct 


gcaacaacta 


1440 


ctgfcctcaac 


gtcatgaagg 


gctgcttggc 


aaatcaggct 


gacctcgaca 


cagagtggaa 


1500 


tctgtttata 


gatgcaatgc 


tcttggtggc 


agagcgactg 


gaggggccat 


tcaacattga 


1560 


gtcggtcatg 


gacccgatag 


atgtcaagat 


ttctgaagcc 


attatgaaca 


tgcaagaaaa 


1620 


cagcatgcag 


gtgtctgcaa 


aggtctttca 


gggatgtggt 


cagcccaaac 


ctgctccagc 


1680 


cctcagatc'c 


gcccgctcag 


ctcctgaaaa 


ttttaataca 


cgtttcaggc 


cctacaatcc 


1740 


tgaggaaaga 


ccaacaactg 


ctgcaggcac 


aagcttggac 


cggctggtca 


cagacataaa 


1800 


agagaaattg 


aagctctcta 


aaaaggtctg 


gtcagcatta 


ccctacacta 


tctgcaagga 


1860 


cgagagcgtg 


acagcgggca 


cgtccaacga 


ggaggaatgc 


tggaacgggc 


acagcaaagc 


1920 


cagatacttg 


cctgagatca 


tgaatgatgg 


gctcaccaac 


cagatcaata 


atcccgaggt 


1980 


ggatgtggac 


atcactcggc 


ctgacacttt 


catcagacag 


cagattatgg 


ctctccgtgt 


2040 


gatgaccaac 


aaactaaaaa 


acgcctacaa 


tggcaatgat 


gtcaatttcc 


aggacacaag 


2100 


tgatgaatcc 


agtggctcag 


ggagtggcag 


tgggtgcatg 


gatgacgtgt 


gtcccacgga 


2160 


gtktgagttt 


gtcaccacag 


aggcccccgc 


agtggatccc 


gaccggagag 


aggtggactc 


2220 


ttctgcagcc 


cagcgtggcc 


actccctgct 


ctcctggtct 


ctcacctgca 


ttgtcctggc 


2280 


actgcagaga 


ctgtgcagat 


aatcttgggt 


ttttggtcag 


atgaaactgc 


attttagcta 


2340 


tctgaatggc 


caactcactt 


cttttcttac 


actcttggac 


aatggaccat 


gccacaaaaa 


2400 


cttaccgttt 


tctatgagaa 


gagagcagta 


atgcaatctg 


cctccctttt 


tgttttccca 


2460 


aagagtaccg 


ggtgccagac 


tgaactgctt 


cctctttcct 


tcagctatct 


gtggggacct 


2520 


tgtttattct 


agagagaatt 


cttactcaaa 


tttttcgtac 


caggagattt 


tcttaccttc 


2580 


atttgctttt 


atgctgcaga 


agtaaaggaa 


tctcacgttg 


tgagggtttt 


tttttttctc 


2640 


attt 












2644 


<210> 11 

<211> 2760 

<212> DNA 

< 2 1 3 > Homo s ap i ens 












<400> 11 
ccaggatttc 


tctcttccta 


tttcaggagg 


actctcacag 


gctcccacag 


cctgtgttaa 


60 


gctgaggttt 


cccctagatc 


tcgtatatcc 


ccaacacata 


cctccacgca 


cacacatccc 


120 


caagaacctc 


gagctcacac 


caacagacac 


acgcgcgcat 


acacactcgc 


tctcgcttgt 


180 


ccatctccct 


cccgggggag 


ccggcgcgcg 


ctcccacctt 


tgccgcacac 


tccggcgagc 


240 
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cgagcccgca gcgctccagg attctgcggc tcggaactcg gattgcagct ctgaaccccc 300 

atggtggttt tttaaacact tcttttcctt ctcttcctcg ttttgattgc accgtttcca 360 

tctgggggct agaggagcaa ggcagcagcc ttcccagcca gcccttgttg gcttgccatc 420 

gtccatctgg cttataaaag tttgctgagc gcagtccaga gggctgcgct gctcgtcocc 480 

tc g gcfcggea - gaagggggtg ao g cfegggc a- goggc-gagga gogcgccgct- g cct c tggcg 540. 

ggctttcggc ttgaggggca aggtgaagag cgcaccggcc gtggggttta ccgagctgga 600 

tttgtatgtt gcaccatgcc ttcttggatc ggggctgtga ttcttcccct cftggggctg 660 

ctgctctccc tccccgccgg ggcggatgtg aaggctcgga gctgcggaga ggtccgccag 720 

gcgtacggtg ccaagggatt cagcctggcg gacatcccct accaggagat cgcaggggaa 7 80 

cacttaagaa tctgtcctca ggaatataca tgctgcacca cagaaatgga agacaagtta 840 

agccaacaaa gcaaactcga atttgaaaac cttgtggaag agacaagcca ttttgtgcgc 900 

accacttttg tgtccaggca taagaaattt gacgaatttt tccgagagct cctggagaat 960 

gcagaaaagt cactaaatga tatgtttgta cggacctatg gcatgctgta catgcagaat 1020 

tcagaagtct tccaggacct cttcacagag ctgaaaaggt actacactgg gggtaatgtg 1080 

aatctggagg aaatgctcaa tgacttttgg gctcggctcc tggaacggat gtttcagctg 1140 

ataaaccctc agtatcactt cagtgaagac tacctggaat gtgtgagcaa atacactgac 1200 

cagctcaagc catttggaga cgtgccccgg aaactgaaga ttcaggttac ccgcgccttc 1260 

attgctgcca ggacctttgt ccaggggctg actgtgggca gagaagttgc aaaccgagtt 1320 

tccaaggtca gcccaacccc agggtgtatc cgtgccctca tgaagatgct gtactgccca 1380 

tactgtcggg ggcttcccac tgtgaggccc tgcaacaact actgtctcaa cgtcatgaag 1440 

ggctgcttgg caaatcaggc tgacctcgac acagagtgga atctgtttat agatgcaatg 1500 

ctcttggtgg cagagcgact ggaggggcca ttcaacattg agtcggtcat ggacccgata 1560 

gatgtcaaga tttctgaagc cattatgaac atgcaagaaa acagcatgca ggtgtctgca 1620 
aaggtctttc agggatgtgg tcagcccaaa cctgctccag ccctcagatc tgcccgctca- 1680 

gctcctgaaa attttaatac acgtttcagg ccctacaatc ctgaggaaag accaacaact 1740 

gctgcaggca caagcttgga ccggctggtc acagacataa aagagaaatt gaagctctct 1800 

aaaaaggtct ggtcagcatt accctacact atctgcaagg acgagagcgt gacagogggc 1860 

acgtccaacg aggaggaatg ctggaacggg cacagcaaag ccagatactt gcctgagatc 1920 

atgaatgatg ggctcaccaa ccagatcaat aatcccgagg tggatgtgga catcactcgg 1980 

cctgacactt tcatcagaca gcagattatg gctctccgtg tgatgaccaa caaactaaaa 2040 

aacgcctaca atggcaatga tgtcaatttc caggacacaa gtgatgaatc cagtggctca 2100 
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gggagtggca gtgggtgcat ggatgacgtg tgtcccacgg agtttgagtt tgtcaccaca 2160 

gaggcccccg cagtggatcc cgaccggaga gaggtggact cttctgcagc ccagcgtggc 2220 

cactccctgc tctcctggtc tctcacctgc attgtcctgg cactgcagag actgtgcaga 2280 

Uaatcttggg tttttggtca gatgaaactg cattttagct atctgaatgg ccaactcact 2340 



-tefcfefete-fet-a- 


-eae fegfefcgga 


-eaafcefgass*- 










agagagcagt 


aatgcaatct 


gcctcccttt 


ttgttttccc 


aaagagtacc 


gggtgccaga 


2460 


ctgaactgct 


tcctctttcc 


ttcagctatc 


tgtggggacc 


ttgtttattc 


tagagagaat 


2520 


tcttactcaa 


atttttcgta 


ccaggagatt 


ttcttacctt 


catttgcttt 


tatgctgcag 


2580 


aagtaaagga 


atctcacgtt 


gtgagggttt 


ttttttttct 


catttaaaat 


aaaaaaggaa 


2640 


gaaagaaaat 


aattttcctt 


gtaaaatcgg 


gccaaacccc 


aagacagcta 


cattttcaac 


2700 


aaaaaagcaa 


acagagaaaa 


ataaatgaac 


tttaacactg 


taagttcagc 


attgacagcc 


2760 



<210> 12 

<211> 1799 

<212> DNA 

<213> Homo sapiens 

<400> 12 



gccgtggggt 


ttaccgagct 


ggatttgtat 


gttgcaccat 


gccttcttgg 


atcggggctg 


60 


tgattcttcc 


cctcttgggg 


ctgctgctct 


ccctccccgc 


cggggcggat 


gtgaaggctc 


120 


ggagctgcgg 


agaggtccgc 


caggcg'cacg 


gtgccaaggg 


attcagcctg 


gcggacatcc 


180 


cctaccagga 


gatcgcaggg 


gaacacttaa 


gaatctgtcc 


tcaggaatat 


acatgctgca 


240 


ccacagaaat 


ggaagacaag 


ttaagccaac 


aaagcaaact 


cgaatttgaa 


aaccttgtgg 


300 


aagagacaag 


ccattttgtg 


cgcaccactt 


ttgtgtccag 


gcataagaaa 


tttgacgaat 


360 


ttttccgaga 


gctcctggag 


aatgcagaaa 


agtcactaaa 


tgatatgttt 


gtacggacct 


420 


atggcatgct 


gtacatgcag 


aattcagaag 


tcttccagga 


cctcttcaca 


gagctgaaaa 


480 


ggtactacac 


tgggggtaat 


gtgaatctgg 


aggaaatgct 


caatgacttt 


tgggctcggc 


540 


tcctggaaog 


gatgtttcag 


ctgataaacc 


ctcagtatca 


cttcagtgaa 


gactacctgg 


600 


aatgtgtgag 


caaatacact 


gaccagctca 


agccatttgg 


agacgtgccc 


cggaaactga 


660 


agattcaggt 


tacccgcgcc 


ttcattgctg 


ccaggacctt 


tgtccagggg 


ctgactgtgg 


720 


gcagagaagt 


tgcaaaccga 


gtttccaagg 


tcagcccaac 


cccagggtgt 


atccgtgccc 


780 


tcatgaagat 


gctgtactgc 


ccatactgtc 


gggggcttcc 


cactgtgagg 


ccctgcaaca 


840 


actactgtct 


caacgtcatg 


aagggctgct 


tggcaaatca 


ggctgacctc 


gacacagagt 


900 


ggaatctgtt 


tatagatgca 


atgctcttgg 


tggcagagcg 


actggagggg 


ccattcaaca 


960 
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ttgagtcggt catggacccg atagatgtca agatttctga agccattatg aacatgcaag 1020 

aaaacagcat gcaggtgtct gcaaaggtct ttcagggatg tggtcagccc aaacctgctc 1080 

cagccctcag atctgcccgc tcagctcctg aaaattttaa tacacgtttc aggccctaca 1140 

atcctgagga aagaccaaca actgctgcag gcacaagctt ggaccggctg gtcacagaca 1200 

taaaagagaa attgaagctc tctaaaaagg tctggtcagc attaccctac actatctgca ' 12 60 



aggacgagag cgtgacagcg ggcacgtcca acgaggagga atgctggaac gggcacagca 1320 

aagccagata cttgcctgag atcatgaatg atgggctcac caaccagatc aacaatcccg 1380 

aggtggatgt ggacatcact cggcctgaca ctttcatcag acagcagatt atggctctcc 1440 

gtgtgatgac caacaaacta aaaaacgcct acaatggcaa tgatgtcaat ttccaggaca 1500 

caagtgatga atccagtggc tcagggagtg gcagtgggtg catggatgac gtgtgtccca 1560 

cggagtttga gtttgtcacc acagaggccc ccgcagtgga tcccgaccgg agagaggtgg 1620 

actcttctgc agcccagcgt ggccactccc tgctctcctg gtctctcacc tgcattgtcc 1680 

tggcactgca gagactgtgc agataatctt gggtttttgg tcagatgaaa ctgcatttta 1740 

gctatctgaa tggccaactc acttcttttc ttacactctt ggacaatgga ccatgccac 1799 



<210> 13 

<211> 558 

<212> PRT 

<213> Homo sapiens 

<400> 13 

Met Glu Leu Arg Ala Arg Gly Trp Trp Leu Leu Cys Ala Ala Ala Ala 
15 10 15 



Leu Val Ala Cys Ala Arg Gly Asp Pro Ala Ser Lys Ser Arg Ser Cys 
20 25 30 



Gly Glu Val Arg Gin He Tyr Gly Ala Lys Gly Phe Ser Leu Ser Asp 
35 40 45 



Val Pro Gin Ala Glu He Ser Gly Glu His Leu Arg He Cys Pro Gin 
50 55 60 



Gly Tyr Thr Cys Cys Thr Ser Glu Met Glu Glu Asn Leu Ala Asn Arg 
65 70 75 80 



Ser His Ala Glu Leu Glu Thr Ala 'Leu Arg Asp Ser Ser Arg Val Leu 

B5 90 95 



Gin Ala Met Leu Ala Thr Gin Leu Arg Ser Phe Asp Asp His Phe Gin 
100 105 110 
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His Leu Leu Asrx Asp Ser Glu Arg Thr Leu Gin Ala Thr Phe Pro Gly 
115 120 125 



Ala Phe Gly Glu Leu Tyr Thr Gin Asn Ala Arg Ala Phe Arg Asp Leu 
130 135 140 



Tyr Ser Glu Leu Arg Leu Tyr Tyr Arg Gly Ala Asn Leu His Leu Glu 
145 150 155 160 



Glu Thr Leu Ala Glu Phe Trp Ala Arg Leu Leu Glu Arg Leu Phe Lys 
165 170 175 



Gin Leu His Pro Gin Leu Leu Leu Pro Asp Asp Tyr Leu Asp Cys Leu 
180 185 190 



Gly Lys Gin Ala Glu Ala Leu Arg Pro Phe Gly Glu Ala Pro Arg Glu 
195 200 205 



Leu Arg lieu Arg Ala Thr Arg Ala Phe Val Ala Ala Arg Ser Phe Val 
210 215 220 



Gin Gly Leu Gly Val Ala Ser Asp Val Val Arg Lys Val Ala Gin Val 
225 230 235 240 



Pro Leu Gly Pro Glu Cys Ser Arg Ala Val Met Lys Leu Val Tyr Cys 
245 250 255 



Ala His Cys Leu Gly Val Pro Gly Ala Arg Pro Cys Pro Asp Tyr Cys 
260 265 270 



Arg Asn Val Leu Lys Gly Cys Leu Ala Asn Gin Ala Asp Leu Asp Ala 
275 280 285 



Glu Trp Arg Asn Leu Leu Asp Ser Met Val Leu lie Thr Asp Lys Phe 
290 295 300 



Trp Gly Thr Ser Gly Val Glu Ser Val lie Gly Ser Val His Thr Trp 
305 310 315 320 



Leu Ala Glu Ala He Asn Ala Leu Gin Asp Asn Arg Asp Thr Leu Thr 
325 330 335 



Ala Lys Val lie Gin Gly Cys Gly Asn Pro Lys Val Asn Pro Gin Gly 
340 345 350 
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Pro Gly Pro Glu Glu Lys Arg Arg Arg Gly Lys Leu Ala Pro Arg Glu 
355 360 365 



Arg Pro Pro Ser Gly Thr Leu Glu Lys Leu Val Ser Glu Ala Lys Ala 
370 375 380 



Gin Leu Arg Asp Val Gin Asp Phe Trp He Ser Leu Pro Gly Thr Leu 

385 iS5 AGO 



Cys Ser Glu Lys Met Ala Leu Ser Thr Ala Ser Asp Asp Arg Cys Trp 
405 410 415 



Asn Gly Met Ala Arg Gly Arg Tyr Leu Pro Glu Val Met Gly Asp Gly 
420 425 430 



Leu Ala Asn Gin He Asn Asn Pro Glu Val Glu Val Asp He Thr Lys 
435 440 - 445 



Pro Asp Met Thr He Arg Gin Gin He Met Gin Leu Lys He Met Thr 
450 455 460 



Asn Arg Leu Arg Ser Ala Tyr Asn Gly Asn Asp Val Asp Phe Gin Asp 
465 470 475 480 



Ala Ser Asp Asp Gly Ser Gly Ser Gly Ser Gly Asp Gly Cys Leu Asp 
485 49C 495 



Asp Leu Cys Gly Arg Lys Val Ser Arg Lys Ser Ser Ser Ser Arg Thr 
500 505 510 



Pro Leu Thr His Ala Leu Pro Gly Leu Ser Glu Gin Glu Gly Gin Lys 
515 520 525 



Thr Ser Ala Ala Ser Cys Pro Gin Pro Pro Thr Phe Leu Leu Pro Leu 
530 535 540 



Leu Leu Phe Leu Ala Leu Thr Val Ala Arg Pro Arg Trp Arg 

555 



545 




550 


<210> 


14 




<211> 


579 




<212> 


PRT 




<213> 


Rattus 


norvegicus 


<400> 


14 





Met Ser Ala Val Arg Pro Leu Leu Leu Leu Leu Leu Pro Leu Cys Pro 
15 10 15 
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Gly Pro Gly Pro Gly His Gly Ser Glu Ala Lys Val Val Arg Ser Cys 
20 25 30 



Ala Glu Thr Arg Gin Val Leu Gly Ala Arg Gly Tyr Ser Leu Asn Leu 
35 40 45 



-.a^ja^^^py-e-^e^-jbea-I- le S e r Gly Glu His Leu Gin H e Cys -Er o G ln. 



50 55 60 



Glu Tyr Thr Cys Cys Ser Ser Glu Thr Glu Gin Lys Leu He Arg Asp 
65 70 75 80 



Ala Glu Val Thr Phe Arg Gly Leu Val Glu Asp Ser Gly Ser Phe Leu 
85 90 95 



He His Thr Leu Ala Ala Arg His Arg Lys Phe Asn Glu Phe Phe Arg 
100 105 110 



Glu Met Leu Ser He Ser Gin His Ser Leu Ala Gin Lou Phe Ser His 
115 120 125 



Ser Tyr Gly Arg Leu Tyr Ser Gin His Ala Val He Phe Asn Ser Leu 
130 135 140 



Phe Ser Gly Leu Arg Asp Tyr Tyr Glu Lys Ser Gly Glu Gly Leu Asp 
145 150 155 160 



Asp Thr Leu Ala Asp Phe Trp Ala Gin Leu Leu Glu Arg Ala Phe Pro 
165 170 175 



Leu Leu His Pro Gin Tyr Ser Phe Pro Pro Asp Phe Leu Leu Cys Leu 
180 185 190 



Thr Arg Leu Thr Ser Thr Ala Asp Gly Ser Leu Gin Pro Phe Gly Asp 
195 200 205 



Ser Pro Arg Arg Leu Arg Leu Gin He Thr Arg Ala Leu Val Ala Ala 
210 215 220 



Arg Ala Leu Val Gin Gly Leu Glu Thr Gly Arg Asn Val Val Ser Glu 
225 230 235 240 



Ala Leu Lys Val Pro Met Leu Glu Gly Cys Arg Gin Ala Leu Met Arg 
245 250 255 



Leu He Gly Cys Pro Leu Cys Arg Gly Val Pro Ser Leu Met Pro Cys 
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260 265 270 



Arg Gly Phe Cys Leu Asn Val Ala His Gly Cys Leu Ser Ser Arg Gly 
275 280 285 



Leu Glu Pro Glu Trp Gly Gly Tyr Leu Asp Gly Leu Leu Leu Leu Ala 
290 295 300 



Glu Lys Leu Gin Gly Pro Phe Ser Phe Glu Leu Ala Ala Glu Ser lie 
305 310 315 320 



Gly Val Lys He Ser Glu Gly Leu Met His Leu Gin Glu Asn Ser Val 

325 330 335 



Lys Val Ser Ala Lys Val Phe Gin Glu Cys Gly Thr Pro His Pro Val 
340 345 350 



Gin Ser Arg Asn Arg Arg Ala Pro Ala Pro Arg Glu Glu Thr Ser Arg 
355 360 365 



Ser Trp Arg Ser Ser Ala Glu Glu Glu Arg Pro Thr Thr Ala Ala Gly 
370 375 380 



Thr Asn Leu His Arg Leu Val Trp Glu Leu Arg Glu Arg Leu Ser Arg 
385 390 395 400 



Val Arg Gly Phe Trp Ala Gly Lou pro Val Thr Val Cys Gly Asp Ser 
405 410 415 



Arg Met Ala Ala Asp Leu Ser Gin Glu Ala Ala Pro Cys Trp Thr Gly 
420 425 430 



Val Gly Arg Gly Arg Tyr Met Ser Pro Val Val Val Gly Ser Leu Asn 
435 440 ' 445 



Glu Gin Leu His Asn Pro Glu Leu Asp Thr Ser Ser Pro Asp Val Pro 
450 455 460 



Thr Arg Arg Arg Arg Leu His Leu Arg Ala Ala Thr Ala Arg Met Lys 
465 470 475 480 



Ala Ala Ala Leu Gly Gin Asp Leu Asp Met His Asp Ala Asp Glu Asp 
485 490 495 



Ala Ser Gly Ser Gly Gly Gly Gin Gin Tyr Ala Asp Asp Trp Lys Ala 
500 505 510 
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Giy Ala Ala Pro Val Val Pro Pro Ala Arg Pro Pro Arg Pro Pro Arg 
515 520 525 



Pro Pro Arg Arg Asp Gly Leu Gly Val Arg Gly Gly Ser Gly Sor Ala 
530 535 540 



.Acs- .-lyx-Asxi-Glji Gly Arg Ser Rrg flsti TiRU Gly. Ser Ssr Val. .Gly .hen. 



545 550 555 560 



His Ala Pro Arg Val Phe lie Leu Leu Pro Ser Ala Leu Thr Leu Leu 
565 570 575 



Gly Leu Arg 



<210> 15 

<211> 579 

<212> PRT 

<213> Homo sapiens 

<400> 15 



Met Ser Ala Leu Arg Pro Leu Leu Leu Leu Leu Leu Pro Leu Cys Pro 
15 10 15 



Gly Pro Gly Pro Gly Pro Gly Ser Glu Ala Lys Val Thr Arg Ser Cys 
20 25 30 



Ala Glu Thr Arg Gin Val Leu Gly Ala Arg Gly Tyr Ser Leu Asn Leu 
35 40 45 



He Pro Pro Ala Leu He Ser Gly Glu His Leu Arg Val Cys Pro Gin 
50 55 60 



Glu Tyr Thr Cys Cys Ser Ser Glu Thr Glu Gin Arg Leu He Arg Glu 
65 70 75 80 



Thr Glu Ala Thr Phe Arg Gly Leu Val Giu Asp Ser Gly Ser Phe Leu 
85 90 95 



Val His Thr Leu Ala Ala Arg His Arg Lys Phe Asp Glu Phe Phe Leu 
100 105 110 



Glu Met Leu Ser Val Ala Gin His Ser Leu Thr Gin Leu Phe Ser His 
115 120 125 



Ser Tyr Gly Arg Leu Tyr Ala Gin His Ala Leu He Phe Asn Gly Leu 
130 135 140 
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Phe Ser Arg Leu Arg Asp Phe Tyr Gly Glu Ser Gly Glu Gly Lou Asp 
145 150 155 160 



Asp Thr Leu Ala Asp Phe Trp Ala Gin Leu Leu Glu Arg Val Phe Pro 
165 170 175 



Leu Leu His Pro Gin Tyr Ser Phe Pro Pro Asp Tyr Leu Leu Cys Leu 
180 185 190 



Ser Arg Leu Ala Ser Ser Thr Asp Gly Ser Leu Gin Pro Phe Gly Asp 
195 200 205 



Ser Pro Arg Arg Leu Arg Leu Gin lie Thr Arg Thr Leu Val Ala Ala 
210 215 220 



Arg Ala Phe Val Gin Gly Leu Glu Thr Gly Arg Asn Val Val Ser Glu 
225 230 235 240 



Ala Leu Lys Val Pro Val Ser Glu Gly Cys Ser Gin Ala Leu Met Arg 
245 250 255 



Leu lie Gly Cys Pro Leu Cys Arg Gly Val Pro Ser Leu Met Pro Cys 
260 265 270 



Gin Gly Phe Cys Leu Asn Val Val Arg Gly Cys Leu Ser Ser Arg Gly 
275 280 285 



Leu Glu Pro Asp Trp Gly Asn Tyr Leu Asp Gly Leu Leu He Leu Ala 
290 295 300 



Asp Lys Leu Gin Gly Pro Phe Ser Phe Glu Leu Thr Ala Glu Ser He 
305 310 315 320 



Gly Val Lys He Ser Glu Gly Leu Met Tyr Leu Gin Glu Asn Ser Ala 
325 330 335 



Lys Val Ser Ala Gin Val Phe Gin Glu Cys Gly Pro Pro Asp Pro Val 
340 345 350 



Pro Ala Arg Asn Arg Arg Ala Pro Pro Pro Arg Glu Glu Ala Gly Arg 
355 360 365 



Leu Trp Ser Met Val Thr Glu Glu Glu Arg Pro Thr Thr Ala Ala Gly 
370 375 380 
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Thr Asn Lou His Arg Leu Val Trp Glu Leu Arg Glu Arg Leu Ala Arg 
385 390 395 400 



Met Arg Gly Phe Trp Ala Arg Leu Ser Leu Thr Val Cys Gly Asp Ser 
405 410 415 



Arg Met Ala Ala Asp Ala Ser Leu Glu Ala Ala Pro Cys Trp Thr Gly 

. 42X1 425 43JD 



Ala Gly Arg Gly Arg Tyr Leu Pro Pro Val Val Gly Gly Ser Pro Ala 
435 440 445 



Glu Gin Val Asn Asn Pro Glu lieu Lys Val Asp Ala Ser Gly Pro Asp 
450 455 460 



Val Pro Thr Arg Arg Arg Arg Leu Gin Leu Arg Ala Ala Thr Ala Arg 
465 470 475 480 



Met Lys Thr Ala Ala Leu Gly His Asp Leu Asp Gly Gin Asp Ala Asp 
485 490 495 



Glu Asp Ala Ser Gly Ser Gly Gly Gly Gin Gin Tyr Ala Asp Asp Trp 
500 505 510 



Met Ala Gly Ala Val Ala Pro Pro Ala Arg Pro Pro Arg Pro Pro Tyr 
515 520 525 



Pro Pro Arg Arg Asp Gly Ser Gly Gly Lys Gly Gly Gly Gly Ser Ala 
530 535 540 



Arg Tyr Asn Gin Gly Arg Ser Arg Ser Gly Gly Ala Ser lie Gly Phe 
545 550 555 560 



His Thr Gin Thr lie Leu lie Leu Ser Leu Ser Ala Leu Ala Leu Leu 
565 570 575 



Gly Pro Arg 



<210> 16 

<211> 580 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Met Ala Gly Thr Val Arg Thr Ala Cys Leu Val Vai Ala Met Leu Leu 
15 10 15 
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Ser lieu Asp Phe Pro Gly Gin Ala Gin Pro Pro Pro Pro Pro Pro Asp 
20 25 30 



Ala Thr Cys His Gin Val Arg Ser Phe Phe Gin Arg Leu Gin Pro Gly 
35 40 45 



-Leu- Ly-s— 5fct 
50 



55 



60 



Cys Leu Pro Lys Gly Pro Thr Cys Cys Ser Arg Lys Met Glu Glu Lys 
65 70 75 80 



Tyr Gin Leu Thr Ala Arg Leu Asn Met Glu Gin Leu Leu Gin Ser Ala 
85 90 95 



Ser Met Glu Leu Lys Phe Leu lie lie Gin Asn Ala Ala Val Phe Gin 
100 105 110 



Glu Ala Phe Glu lie Val Val Arg Kis Ala Lys Asn Tyr Thr Asn Ala 
115 120 125 



Met Phe Lys Asn Asn Tyr Pro Ser Leu Thr Pro Gin Ala Phe Glu Phe 
130 135 140 



Val Gly Glu Phe Phe Thr Asp Val Ser Leu Tyr He Leu Gly Ser Asp 
145 150 155 160 



He Asn Val Asp Asp Met Val Asn Glu Leu Phe Asp Ser Leu Phe Pro 
165 170 175 



Val He Tyr Thr Gin Leu Met Asn Pro Gly Leu Pro Asp Ser Ala Leu 
180 185 190 



Asp He Asn Glu Cys Leu Arg Gly Ala Arg Arg Asp Leu Lys Val Phe 
195 200 205 



Gly Asn Phe Pro Lys Leu He Met Thr Gin Val Ser Lys Ser Leu Gin 
210 215 220 



Val Thr Arg He Phe Leu Gin Ala Leu Asn Leu Gly lie Glu Val He 
225 230 235 240 



Asn Thr Thr Asp His Leu Lys Phe Ser Lys Asp Cys Gly Arg Met Leu 
245 250 255 



Thr Arg Met Trp Tyr Cys Ser Tyr Cys Gin Gly Leu Met Met Val Lys 
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260 265 270 



Pro Cys Gly Gly Tyr Cys Asn Val Val Met Gin Gly Cys Met Ala Gly 
275 280 285 



Val Val Glu He Asp Lys Tyr Trp Arg Glu Tyr He Leu Ser Leu Glu 
290 295 300 



Glu Leu Val Asn Gly Met Tyr Arg He Tyr Asp Met Glu Asn Val Leu 
305 ■ 310 315 320 



Leu Gly Leu Phe Ser Thr He His Asp Ser He Gin Tyr Val Gin Lys 
325 330 335 



Asn Ala Gly Lys Leu Thr Thr Thr He Gly Lys Leu Cys Ala His Ser 
340 345 350 



Gin Gin Arg Gin Tyr Arg Ser Ala Tyr Tyr Pro Glu Asp Leu Phe He 

355 360 365 



Asp Lys Lys Val Lou Lys Val Ala His Val Glu His Glu Glu Thr Leu 

370 375 380 



Ser Ser Arg Arg Arg Glu Leu He Gin Lys Leu Lys Ser Phe He Ser 
385 390 395 400 



Phe Tyr Ser Ala Leu Pro Gly Tyr He Cys Ser His Ser Pro Val Ala 
405 410 415 



Glu Asn Asp Thr Leu Cys Trp Asn Gly Gin Glu Leu Val Glu Arg Tyr 
420 425 430 



Ser Gin Lys Ala Ala Arg Asn Gly Met Lys Asn Gin Phe Asn Leu His 
435 440 445 



Glu Leu Lys Met Lys Gly Pro Glu Pro Val Val Ser Gin He He Asp 
450 455 460 



Lys Leu Lys His He Asn Gin Leu Leu Arg Thr Met Ser Met Pro Lys 
465 470 475 480 



Gly Arg Val Leu Asp Lys Asn Leu Asp Glu Glu Gly Phe Glu Ser Gly 

485 490 495 



Asp Cys Gly Asp Asp Glu Asp Glu Cys He Gly Gly Ser Gly Asp Gly 
500 505 510 
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Met He Ijys Val Lys Asn Gin Leu Arg Phe Leu Ala Glu Leu Ala Tyi 
515 520 525 



Asp Leu Asp Val Asp Asp Ala Pro Gly Asn Ser Gin Gin Ala Thr, Pro 
530 535 540 



-fiys "As 
545 



550 



Lou Gly A p n Val -4fts g ar 
555 560 



Pro Leu Lys Leu Leu Thr Ser Met Ala He Ser Val Val Cys Phe Phe 
565 570 575 



Phe Leu Val His 
580 



<210> 17 

<211> 580 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Ala Gly Thr Val Arg Thr Ala Cys Leu Val Val Ala Met Leu Leu 
15 10 15 



Ser Leu Asp Phe Pro Gly Gin Ala Gin Pro Pro Pro Pro Pro Pro Asp 

20 25 30 



Ala Thr Cys His Gin Val Arg Ser Phe Phe Gin Arg Leu Gin Pro Gly 
35 40 45 



Leu Lys Trp Val Pro Glu Thr Pro Val Pro Gly Ser Asp Leu Gin Val 
50 55 60 



Cys Leu Pro Lys Gly Pro Thr Cys Cys Ser Arg Lys Met Glu Glu Lys 
65 70 75 80 



Tyr Gin Leu Thr Ala Arg Leu Asn Met Glu Gin Leu Leu Gin Ser Ala 
85 90 95 



Ser Met Glu Leu Lys Phe Leu He He Gin Asn Ala Ala Val Phe Gin 
100 105 110 



Glu Ala Phe Glu He Val Val Arg His Ala Lys Asn Tyr Thr Asn Ala 
115 120 125 



Met Phe Lys Asn Asn Tyr Pro Ser Leu Thr Pro Gin Ala Phe Glu phe 
130 135 140 
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Val Gly Glu Phe Phe Thr Asp Val Ser Leu Tyr lie Leu Gly Ser Asp 
145 150 155 160 



He Asn Val Asp Asp Met Val Asn Glu Leu Phe Asp Ser Leu Phe Pro 
165 170 175 



Val He Tyr Thr Gin Leu Met Asn Pro Gly Leu Pro Asp Ser Ala Leu 
180 185 190 



Asp He Asn Glu Cys Leu Arg Gly Ala Arg Arg Asp Leu Lys Val Phe 
195 200 205 



Gly Asn Phe Pro Lys Leu He Met Thr Gin Val Ser Lys Ser Leu Gin 
210 215 220 



Val Thr Arg He Phe Leu Gin Ala Leu Asn Leu Gly He Glu Val He 
225 230 235 240 



Asn Thr Thr Asp His Leu Lys Phe Ser Lys Asp Cys Gly Arg Met Leu 
245 250 255 



Thr Arg Met Trp Tyr Cys Ser Tyr Cys Gin Gly Leu Met Met Val Lys 
260 265 270 



Pro Cys Gly Gly Tyr Cys Asn Val Val Met Gin Gly Cys Met Ala Gly 
275 280 285 



Val Val Glu He Asp Lys Tyr Trp Arg Glu Tyr He Leu Ser Leu Glu 
290 295 300 



Glu Leu Val Asn Gly Met Tyr Arg He Tyr Asp Met Glu Asn Val Leu 
305 310 315 320 



Leu Gly Leu Phe Ser Thr He His Asp Ser He Gin Tyr Val Gin Lys 
325 330 335 



Asn Ala Gly Lys Leu Thr Thr Thr He Gly Lys Leu Cys Ala His Ser 
340 345 350 



Gin Gin Arg Gin Tyr Arg Ser Ala Tyr Tyr Pro Glu Asp Leu Phe He 
355 360 365 



Asp Lys Lys Val Leu Lys Val Ala His Val Glu His Glu Glu Thr Leu 
370 375 380 
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Ser Ser Arg Arg Arg Glu Leu lie Gin Lys Leu Lys Ser Phe lie Ser 
385 390 395 400 



Phe Tyr Ser Ala Leu Pro Gly Tyr He Cys Ser His Ser Pro Val Ala 
405 410 415 



Glu Asn Asp Thr Leu Cys Trp Asn Gly Gin Glu Leu Val Glu Arg Tyr 

. 420 425 430 



Ser Gin Lys Ala Ala Arg Asn Gly Met Lys Asn Gin Phe Asn Leu His 
435 440 445 



Glu Leu Lys Met Lys Gly Pro Glu Pro Val Val Ser Gin lie He Asp 
450 455 460 



Lys Leu Lys His He Asn Gin Leu Leu Arg Thr Met Ser Met Pro Lys 
465 470 475 480 



Gly Arg Val Leu Asp Lys Asn Leu Asp Glu Glu Gly Phe Glu Ser Gly 
485 490 495 



Asp Cys Gly Asp Asp Glu Asp Glu Cys He Gly Gly Ser Gly Asp Gly 
500 505 510 



Met He Lys Val Lys Asn Gin Leu Arg Phe Leu Ala Glu Leu Ala Tyr 
515 520 525 



Asp Leu Asp Val Asp Asp Ala Pro Gly Asn Ser Gin Gin Ala Thr Pro 
530 535 540 



Lys Asp Asn Glu He Ser Thr Phe His Asn Leu Gly Asn- Val His Ser 
545 550 555 560 



Pro Leu Lys Leu Leu Thr Ser Met Ala He Ser Val Val Cys Phe Phe 
565 570 575 



Phe Leu Val His 
580 



<210> 18 

<211> 556 

<212> PRT 

<213> Homo sapiens 

<400> 18 

Met Ala Arg Phe Gly Leu Pro Ala Leu Leu Cys Thr Leu Ala Val Leu 
15 10 15 



30/44 



WO 03/006611 



PCT/US02/21694 



Ser Ala Ala Leu Leu Ala Ala Giu Leu Lys Ser Lys Ser Cys Ser Glu 
20 25 30 



Val Arg Arg Leu Tyr Val Ser Lys Gly Phe Asn Lys Asn Asp Ala Pro 
35 40 45 



..Lia^..Ma...Glii..:U^_Asi^ He Cys Pro Gin Gly Sor 



50 55 60 



Thr Cys Cys Ser Gin Glu Met Glu Glu Lys Tyr Ser Leu Gin Ser Lys 
65 70 75 80 



Asp Asp Phe Lys Ser Val Val Ser Glu Gin Cys Asn His Leu Gin Ala 
85 90 95 



Val Phe Ala Ser Arg Tyr Lys Lys Phe Asp Glu Phe Phe Lys Glu Leu 
100 105 110 



Leu Glu Asn Ala Glu Lys Ser Lou Asn Asp Met Phe Val Lys Thr Tyr 
115 120 125 



Gly His Leu Tyr Met Gin Asn Ser Glu Leu Phe Lys Asp Leu Phe Val 
130 135 140 



Glu Leu Lys Arg Tyr Tyr Val Val Gly Asn Val Asn Leu Glu Glu Met 
145 150 155 160 



Leu Asn Asp Phe Trp Ala Arg Leu Leu Glu Arg Met Phe Arg Leu Val 
165 170 175 



Asn Ser Gin Tyr His Phe Thr Asp Glu Tyr Leu Glu Cys Val Ser Lys 
180 185 190 



Tyr Thr Glu Gin Leu Lys Pro Phe Gly Asp Val Pro Arg Lys Leu Lys 
195 200 205 



Leu Gin Val Thr Arg Ala Phe Val Ala Ala Arg Thr Phe Ala Gin Gly 
210 215 220 



Leu Ala Val Ala Gly Asp Val Val Ser Lys Val Ser Val Val Asn Pro 
225 230 235 240 



Thr Ala Gin Cys Thr His Ala Leu Leu Lys Met lie Tyr Cys Ser His 
245 250 255 



Cys Arg Gly Leu Val Thr Val Lys Pro Cys Tyr Asn Tyr Cys Ser Asn 
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260 265 270 

lie Met Arg Gly Cys Leu Ala Asn Gin Gly Asp Leu Asp Phe Glu Trp 
275 280 285 

Asn Asn Phe lie Asp Ala Met Leu Met Val Ala Glu Arg Leu Glu Gly 
290 295 300 



Pro Phe Asn lie Glu Ser Val Met Asp Pro lie Asp Val Lys lie Ser 
305 310 315 320 



Asp Ala lie Met Asn Met Gin Asp Asn Ser Val Gin Val Ser Gin Lys 
325 330 335 



Val Phe Gin Gly Cys Gly Pro Pro Lys Pro Leu Pro Ala Gly Arg He 
340 345 350 



Ser Arg Ser He Ser Glu Ser Ala Phe Ser Ala Arg Phe Arg Pro His 
355 360 365 



His Pro Glu Glu Arg Pro Thr Thr Ala Ala Gly Thr Ser Leu Asp Arg 
370 375 380 



Leu Val Thr Asp Val Lys Glu Lys Leu Lys Gin Ala Lys Lys Phe Trp 
385 390 395 400 



Ser Ser Leu Pro Ser Asn Val Cys Asn Asp Glu Arg Met Ala Ala Gly 
405 410 415 



Asn Gly Asn Glu Asp Asp Cys Trp Asn Gly Lys Gly Lys Ser Arg Tyr 
420 425 430 



Leu Phe Ala Val Thr Gly Asn Gly Leu Ala Asn Gin Gly Asn Asn Pro 
435 440 445 



Glu Val Gin Val Asp Thr Ser Lys Pro Asp He Leu He Leu Arg Gin 
450 455 460 



He Met Ala Leu Arg Val Met Thr Ser Lys Met Lys Asn Ala Tyr Asn 
465 470 475 480 



Gly Asn Asp Val Asp Phe Phe Asp He Ser Asp Glu Ser Ser Gly Glu 
485 490 495 



Gly Ser Gly Ser Gly Cys Glu Tyr Gin Gin Cys Pro Ser Glu Phe Asp 
500 505 510 
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Tyr Asn Ala Thr Asp His Ala Gly Lys Ser Ala Asn Glu Lys Ala Asp 
515 520 525 



Ser Ala Gly Val Arg Pro Gly Ala Gin Ala Tyr Leu Leu Thr Val Phe 
530 535 540 



-Cys lie ....Leu Ehs Leu Val Met Gin Ara Glu Trp Arg 



545 550 555 



<210> 19 

<211> 556 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Met Ala Arg Phe Gly Lou Pro Ala Leu Leu Cys Thr Leu Ala Val Leu 

1-5 10 15 



Ser Ala Ala Leu Leu Ala Ala Glu Leu Lys Ser Lys Ser Cys Ser Glu 
20 25 30 



Val Arg Arg Leu Tyr Val Ser Lys Gly Phe Asn Lys Asn Asp Ala Pro 
35 40 . 45 



Leu His Glu He Asn Gly Asp His Leu Lys He Cys Pro Gin Gly Ser 
50 55 60 



Thr Cys pys Ser Gin Glu Met Glu Glu Lys Tyr Ser Leu Gin Ser Lys 
65 70 75 80 



Asp Asp Phe Lys Ser Val Val Ser Glu Gin Cys Asn His Leu Gin Ala 
85 90 95 



Val Phe Ala Ser Arg Tyr Lys Lys Phe Asp Glu Phe Phe Lys Glu Leu 
100 105 110 



Leu Glu Asa Ala Glu Lys Ser Leu Asn Asp Met Phe Val Lys Thr Tyr 
115 120 125 



Gly His Leu Tyr Met Gin Asn Ser Glu Leu Phe Lys Asp Leu Phe Val 
130 135 140 



Glu Leu Lys Arg Tyr Tyr Val Val Gly Asn Val Asn Leu Glu Glu Met 
145 150 155 160 



Leu Asn Asp Phe Trp Ala Arg Leu Leu Glu Arg Met Phe Arg Leu Val 
165 170 175 
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Asn Ser Gin Tyr His Phe Thr Asp Glu Tyr Leu Glu Cys Val Ser Lys 
180 185 190 



Tyr Thr Glu Gin Leu Lys Pro Phe Gly Asp Val Pro Arg Lys Leu Lys 
195 200 205 



Leu Gin Val Thr Arg Ala Phe Val Ala Ala Arg Thr Phe Ala Gin Gly 
210 215 220 



Leu Ala Val Ala Gly Asp Val Val Ser Lys Val Ser Val Val Asn Pro 
225 230 235 240 



Thr Ala Gin Cys Thr His Ala Leu Leu Lys Met He Tyr Cys Ser His 
245 250 255 



Cys Arg Gly Leu Val Thr Val Lys Pro Cys Tyr Asn Tyr Cys Ser Asn 
260 265 270 



He Met Arg Gly Cys Leu Ala Asn Gin Gly Asp Leu Asp Phe Glu Trp 
275 280 285 



Asn Asn Phe He Asp Ala Met Leu Met Val Ala Glu Arg Leu Glu Gly 
290 295 300 



Pro Phe Asn He Glu Ser Val Met Asp Pro He Asp Val Lys He Ser 
305 310 315 320 



Asp Ala He Met Asn Met Gin Asp Asn Ser Val Gin Val Ser Gin Lys 
325 330 335 



Val Phe Gin Gly Cys Gly Pro Pro Lys Pro Leu Pro Ala Gly Arg lie 
340 345 350 



Ser Arg Ser He Ser Glu Ser Ala Phe Ser Ala Arg Phe Arg Pro His 
355 360 365 



His Pro Glu Glu Arg Pro Thr Thr Ala Ala Gly Thr Ser Leu Asp Arg 
370 375 380 



Leu Val Thr Ash Val Lys Asp Lys Leu Lys Gin Ala Lys Lys Phe Trp 
385 390 395 400 



Ser Ser Leu Pro Ser Asn Val Cys Asn Asp Glu Arg Met Ala Ala Gly 
405 410 415 
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Asn Gly Asn Glu Asp Asp Cys Trp Asn Gly Lys Gly Lys Ser Arg Tyr 
420 425 430 



Leu Phe Ala Vai Thr Gly Asn Gly Leu Val Asn Gin Gly Asn Asn Pro 
435 440 445 



Glu Val Gin Val Asp Thr Ser Lys Pro Asp lie Leu lie Leu Arg Gin 

450 455 460 



lie Met Ala Leu Arg Val Met Thr Ser Lys Met Lys Asn Ala Tyr Asn 
465 470 475 480 



Gly Asn Asp Val Asp Phe Phe Asp lie Ser Asp Glu Ser Ser Gly Glu 
485 490 495 



Gly Ser Gly Ser Gly Cys Glu Tyr Gin Gin Cys Pro Ser Glu Phe Asp 
500 505 ' 510 



Tyr Asn Ala Thr Asp His Ala Gly Lys Ser Ala Asn Glu Lys Ala Asp 

520 525 



535 540 



550 555 





515 


Ser Ala 


Gly 


530 




Cys lie 


Leu 


545 




<210> 


20 


<211> 


572 


<212> 


PRT 


<213> 


Homo 


<400> 


20 



Met Asp Ala Gin Thr Trp Pro Val Gly Phe Arg Cys Leu Leu Leu Leu 

1 5 10 15 



Ala Leu Vai Gly Ser Ala Arg Ser Glu Gly Val Gin Thr Cys Glu Glu 
20 25 30 



Val Arg Lys Leu Phe Gin Trp Arg Leu Leu Gly Ala Val Arg Gly Leu 
35 40 45 



Pro Asp Ser Pro Arg Ala Gly Pro Asp Leu Gin Val Cys lie Ser Lys 
50 55 60 



Lys Pro Thr Cys Cys Thr Arg Lys Met Glu Glu Arg Tyr Gin He Ala 
65 70 75 80 
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Ala Arg Gin Asp Met Gin Gin Phe Leu Gin Tlir Ser Ser Ser Thr Leu 
85 90 95 



Lys Phe Leu He Ser Arg Asn Ala Ala Ala Phe Gin Glu Thr Leu Glu 
100 105 110 



Thr Leu lie Lys Gin Ala Glu Asn Ty r Thr Ser He Leu Phe Cys Ser 



115 120 125 



Thr Tyr Arg Asn Met Ala Leu Glu Ala Ala Ala Ser Val Gin Glu Phe 
130 135 140 



Phe Thr Asp Val Gly Leu Tyr Leu Phe Gly Ala Asp Val Asn Pro Glu 
145 150 155 160 



Glu Phe Val Asn Arg Phe Phe Asp Ser Leu Phe Pro Leu Val Tyr Asn 
165 170 175 



His Leu He Asn Pro Gly Val Thr Asp Ser Ser Leu Glu Tyr Ser Glu 
180 185 130 



Cys He Arg Met Ala Arg Arg Asp Val Ser Pro Pho Gly Asn He Pro 
195 200 205 



Gin Arg Val Met Gly Gin Met Gly Arg Ser Leu Leu Pro Ser Arg Thr 
210 215 220 



Phe Leu Gin Ala Leu Asn Leu Gly He Glu Val He Asn Thr Thr Asp 
225 230 235 240 



Tyr Leu His Phe Ser Lys Glu Cys Ser Arg Ala Leu Leu Lys Met Gin 
245 250 255 



Tyr Cys Pro His Cys Gin Gly Leu Ala Leu Thr Lys Pro Cys Met Gly 
260 265 270 



Tyr Cys Leu Asn Val Met Arg Gly Cys Leu Ala His Met Ala Glu Leu 
275 280 285 



Asn Pro His Trp His Ala Tyr lie Arg Ser Leu Glu Glu Leu Ser Asp 
290 295 300 



Ala Met His Gly Thr Tyr Asp He Gly His Val Leu Leu Asn Phe His 
305 310 315 320 



Leu Leu Val Asn Asp Ala Val Leu Gin Ala His Leu Asn Gly Gin Lys 



36/44 



WO 03/006611 



PCT/US02/21694 



325 330 335 



Leu Leu Glu Gin Val Asn Arg lie Cys Gly Arg Pro Val Arg Thr Pro 
340 345 350 



Thr Gin Ser Pro Arg Cys Ser Phe Asp Gin Ser Lys Glu Lys His Gly 

355 360 365 



Met Lys Thr Thr Thr Arg Asn Ser Glu Glu Thr Leu Ala Asn Arg Arg 
370 375 380 



Lys Glu Phe He Asn Ser Leu Arg Leu Tyr Arg Ser Phe Tyr Gly Gly 
385 390 395 400 



Leu Ala Asp Gin Leu Cys Ala Asn Glu Leu Ala Ala Ala Asp Gly Leu 
405 410 415 



Pro Cys Trp Asn Gly Glu Asp lie Val Lys Ser Tyr Thr Gin Arg Val 
420 425 430 



Val Gly Asn Gly He Lys Ala Gin Ser Gly Asn Pro Glu Val Lys Val 
435 440 445 



Lys Gly He Asp Pro Val He Asn Gin He He Asp Lys Leu Lys His 
450 455 460 



Val Val Gin Leu Lou Gin Gly Arg Ser Pro Lys Pro Asp Lys Trp Glu 
465 470 475 480 



Leu Leu Gin Leu Gly Ser Gly Gly Gly Met Val Glu Gin Val Ser Gly 
485 490 495 



Asp Cys Asp Asp Glu Asp Gly Cys Gly Gly Ser Gly Ser Gly Glu Val 
500 505 510 



Lys Arg Thr Leu Lys He Thr Asp Trp Met Pro Asp Asp Met Asn Phe 
515 520 525 



Ser Asp Val Lys Gin He His Gin Thr Asp Thr Gly Ser Thr Leu Asp 
530 535 540 



Thr Thr Gly Ala Gly Cys Ala Val Ala Thr Glu Ser Met Thr Phe Thr 
545 550 555 560 



Leu He Ser Val Val Met Leu Leu Pro Gly He Trp 
565 570 
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<210> 21 

<211> 555 

<212> PRT 

<213> Homo sapiens 

<400> 21 

Met Pro Ser Trp He Gly Ala Val He Leu Pro Leu Leu Gly Leu Leu 

...i 10 3b 



Leu Ser Leu Pro Ala Glv Ala Asp Val Lys Ala Arg Ser Cys Gly Glu 
20 25 30 



Val Arg Gin Ala Tyr Gly Ala Lys Gly Phe Ser Leu Ala Asp He Pro 
35 40 45 



Tyr Gin Glu He Ala Gly Glu His Leu Arg He Cys Pro Gin Glu Tyr 
50 55 60 

Thr Cys Cys Thr Thr Glu Met Glu Asp Lys Leu Ser Gin Gin Ser Lys 

65 70 75 80 



Leu Glu Phe Glu Asn Leu Val Glu Glu Thr Ser His Phe Val Arg Thr 
85 90 95 



Thr Phe Val Ser Arg His Lys Lys Phe Asp Glu Phe Phe Arg Glu Leu 
100 105 • HO 



Leu Glu Asn Ala Glu Lys Ser Leu Asn Asp Met' Phe Val Arg Thr Tyr 
115 120 125 



Gly Met Leu Tyr Met Gin Asn Ser Glu Val Phe Gin Asp Leu Phe Thr 
130 135 140 

Glu Leu Lys Arg Tyr Tyr Thr Gly Gly Asn Val Asn Leu Glu Glu Met 
145 150 155 ISO 

Leu Asn Asp Phe Trp Ala Arg Leu Leu Glu Arg Met Phe Gin Leu He 
165 170 175 



Asn Pro Gin Tyr His Phe Ser Glu Asp Tyr Leu Glu Cys Val Ser Lys 
180 185 190 



Tyr Thr Asp Gin Leu Lys Pro Phe Gly Asp Val Pro Arg Lys Leu Lys 
195 200 205 



Tie Gin Val Thr Arg Ala Phe He Ala Ala Arg Thr Phe Val Gin Gly 
210 215 220 
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Leu Thr Val Gly Arg Glu Val Ala Asn Arg Val Ser Lys Val Ser Pro 

225 230 235 240 



Thr Pro Gly Cys lie Arg Ala Leu Met Lys Met Leu Tyr Cys Pro Tyr 
245 250 255 



Cys Arg Gly Leu Pro Thr Val Arg Pro Cys Asn Asn Tyr Cys Leu Asn 
260 265 270 



Val Met Lys Gly Cys Leu Ala Asn Gin Ala Asp Leu Asp Thr Glu Trp 
275 280 285 



Asn Leu Phe He Asp Ala Met Leu Leu Val Ala Glu Arg Leu Glu Gly 
290 295 300 



Pro Phe Asn He Glu Ser Val Met Asp Pro He Asp Val Lys He Ser 
305 310 315 320 



Glu Ala He Mot Asn Met Gin Glu Asn Ser Met Gin Val Ser Ala Lys 
325 330 335 



Val Phe Gin Gly Cys Gly Gin Pro Lys Pro Ala Pro Ala Leu Arg Ser 
340 345 350 



Ala Arg Ser Ala Pro Glu Asn Phe Asn Thr Arg Phe Arg Pro Tyr Asn 
355 360 365 



Pro Glu Glu Arg Pro Thr Thr Ala Ala Gly Thr Ser Leu Asp Arg Leu 
370 375 380 



Val Thr Asp He Lys Glu Lys Leu Lys Leu Ser Lys Lys Val Trp Ser 
385 390 395 400 



Ala Leu Pro Tyr Thr He Cys Lys Asp Glu Ser Val Thr Ala Gly Thr 
405 410 415 



Ser Asn Glu Glu Glu Cys Trp Asn Gly His Ser Lys Ala Arg Tyr Leu 
420 425 430 



Pro Glu He Met Asn Asp Gly Leu Thr Asn Gin He Asn Asn Pro Glu 
435 440 445 



Val Asp Val Asp He Thr Arg Pro Asp Thr Phe He Arg Gin Gin He 
450 455 460 
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Met Ala Leu Arg Val Met Thr Asn Lys Leu Lys Asn Ala Tyr Asn Gly 
465 470 475 480 



Asn Asp Val Asn Phe Gin Asp Thr Ser Asp Glu Ser Ser Gly Ser Gly 
485 490 495 



Ser Gly Ser Gly Cys Met Asp Asp Val Cys Pro Thr Glu Phe Glu Phe 
500 505 510 



Val Thr Thr Glu Ala Pro Ala Val Asp Pro Asp Arg Arg Glu Val Asp 
515 520 525 



Ser Ser Ala Ala Gin Arg Gly His Ser Leu Leu Ser Trp Ser Leu Thr 
530 535 540 



Cys He Val Leu Ala Leu Gin Arg Leu Cys Arg 
545 550 555 



<210> 22 

<211> 555 

<212> PRT 

<213> Homo sapiens 

<400> 22 

Met Pro Ser Trp He Gly Ala Val He Leu Pro Leu Leu Gly Leu Leu 
15 10 15 



Leu Ser Leu Pro Ala Gly Ala Asp Val Lys Ala Arg Ser Cys Gly Glu 
20 25 30 



Val Arg Gin Ala Tyr Gly Ala Lys Gly Phe Ser Leu Ala Asp He Pro 
35 40 45 



Tyr Gin Glu He Ala Gly Glu His Leu Arg He Cys Pro Gin Glu Tyr 
50 55 60 



Thr Cys Cys Thr Thr Glu Met Glu Asp Lys Leu Ser Gin Gin Ser Lys 
65 70 75 80 



Leu Glu Phe Glu Asn Leu Val Glu Glu Thr Ser His Phe Val Arg Thr 
85 90 95 



Thr Phe Val Ser Arg His Lys Lys Phe Asp Glu Phe Phe Arg Glu Leu 
100 105 110 



Leu Glu Asn Ala Glu Lys Ser Leu Asn Asp Met Phe Val Arg Thr Tyr 
115 120 125 
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Gly Met Leu Tyr Met Gin Asn Ser Glu Val Phe Gin Asp Leu Phe Thr 
130 135 140 



Glu Leu Lys Arg Tyr Tyr Thr Gly Gly Asn Val Asn Leu Glu Glu Met 
145 ' 150 155 160 



hBU A-sn.- Asp -P-fee- '!'rp 

165 



170 



175 



Asn Pro Gin Tyr His Phe Ser Glu Asp Tyr Leu Glu Cys Val Ser Lys 
180 185 190 



Tyr Thr- Asp Gin Leu Lys Pre Phe Gly Asp Val Pro Arg Lys Leu Lys 
195 200 205 



He Gin Val Thr Arg Ala Phe He Ala Ala Arg Thr Phe Val Gin Gly 
210 215 220 



Leu Thr Val Gly Arg Glu Val Ala Asn Arg Val Ser Lys Val Ser Pro 
225 230 235 240 



Thr Pro Gly Cys He Arg Ala Leu Met Lys Met Leu Tyr Cys Pro Tyr 
245 250 255 



Cys Arg Gly Leu Pro Thr Val Arg Pro Cys Asn Asn Tyr Cys Leu Asn 
260 265 270 



Val Met Lys Gly Cys Leu Ala Asn Gin Ala Asp Leu Asp Thr Glu Trp 
275 280 285 



Asn Leu Phe He Asp Ala Met Leu Leu Val Ala Glu Arg Leu Glu Gly 

290 295 300 



Pro Phe Asn lie Glu Ser Val Met Asp Pro He Asp Val Lys He Ser 
305 310 315 320 



Glu Ala He Met Asn Met Gin Glu Asn Ser Met Gin Val Ser Ala Lys 
325 330 335 



Val Phe Gin Gly Cys Gly Gin Pro Lys Pro Ala Pro Ala Leu Arg Ser 
340 345 350 



Ala Arg Ser Ala Pro Glu Asn Phe Asn Thr Arg Phe Arg Pro Tyr Asn. 
355 360 365 



Pro Glu Glu Arg Pro Thr Thr Ala Ala Gly Thr Ser Leu Asp Arg Leu 
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370 375 380 



Val Thr Asp He Lys Glu Lys Leu Lys Leu Ser Lys Lys Val Trp Ser 
385 " 390 395 400 



Ala Leu Pro Tyr Thr He Cys Lys Asp Glu Ser Val Thr Ala Gly Thr 
405 410 415 



Ser Asn Glu Glu Glu Cys Trp Asn Gly His Ser Lys Ala Arg Tyr Leu 
420 425 430 



Pro Glu He Met Asn Asp Gly Leu Thr Asn Gin He Asn Asn Pro Glu 
435 440 445 



Val Asp Val Asp He Thr Arg Pro Asp Thr Phe He Arg Gin Gin He 
450 455 460 



Met Ala Leu Arg Val Mot Thr Asn Lys Leu Lys Asn Ala Tyr Asn Gly 
465 470 475 480 



Asn Asp Val Asn Phe Gin Asp Thr Ser Asp Glu Ser Ser Gly Ser Gly 
485 490 495 



Ser Gly Ser Gly Cys Met Asp Asp Val Cys Pro Thr Glu Phe Glu Phe 
500 505 510 



Val Thr Thr Glu Ala Pro Ala Val Asp Pro Asp Arg Arg Glu Val Asp 
515 520 525 



Ser Ser Ala Ala Gin Arg Gly His Ser Leu Leu Ser Trp Ser Leu Thr 
530 535 540 



Cys He Val Leu Ala Lou Gin Arg Leu Cys Arg 
545 550 555 



<210> 23 

<211> 450 

<212> DNA 

<213> Homo sapiens 

<400> 23 



tttttttctt 


tctttccttc 


cttctttcct 


tgtttctttc 


tctgtctctc 


tctctctctt 


60 


tctttctctc 


tctctctttc 


tttcttgaga 


tgacgtctag 


ctatgttgcc 


ccagctgggc 


120 


tccaactcct 


ggcctcaagt 


gatcctcctg 


cctcagcctc 


tcaaagtgct 


gtgattacag 


180 


gtgtgagcca 


ccatgcctgg 


ctccaggctt 


tcttgaatcc 


cctcacccag 


atagttgccc 


240 


cagtcaggct 


ccagtcccct 


gctgctgaga 


cagccacgaa 


ccacgttgag 


gcagaagccc 


300 
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tggcagggca taagtgaggg gaccccccgg cacaggggac agccgatgag acgcatcaga 3 60 
gcctggctgc agccttcaga caccggoacc tgggggcaga gagtggggct gtgtcctcca 420 
caaacgctgt ggctgctctc agcctggctt 450 

<210> 24 

-^■3"3:"3r'*^ 3"'3i~3r'& 1 1 ~~ — 

<212> DNA 

<213> Homo sapiens 

<400> 24 



gcgcccaggt 


agctgcgagg 


aaacttttgc 


agcggctggg 


tagcagcacg 


tctcttgctc 


60 


ctcagggcca 


ctgccaggct 


tgccgagtcc 


tgggactgct 


ctcgctccgg 


ctgccactct 


12 0 


cccgcgctct 


cctagctccc 


tgcgaagcag 


gatggccggg 


accgtgcgca 


ccgcgtgctt 


180 


ggtggtggcg 


atgctgctca 


gcttggactt 


cccgggacag 


gcgcagcccc 


cgccgccgcc 


240 


gccggacgcc 


acctgtcacc 


aagtccgctc 


cttcttccag 


agactgcagc 


ccggactcaa 


300 


gtgggtgcca 


gaaactcccg 


tgccaggatc 


agatttgcaa 


gtatgtctcc 


ctaagggccc 


360 


aacatgctgc 


tcaagaaaga 


tggaagaaaa 


ataccaacta 


acagcacgat 


tgaacatgga 


420 


acagctgctt 


cagtctgcaa 


gtatggagct 


caagttctta 


attattcaga 


atgctgcggt 


480 


tttccaagag 


gcctttgaaa 


ttgttgtzcg 


ccatgccaag 


aactacacca 


atgccatgtt 


540 


caagaacaac 


tacccaagcc 


tgactccaca 


agctUttgag 


tttgtgggtg 


aatttttcac 


600 


agatgtgtct 


cUctacatct 


tgggttctga 


catcaatgta 


gatgacatgg 


tcaatgaatt 


660 


gtttgacagc 


ctgtttccag 


tcatctatac 


ccagctaatg 


aacccaggcc 


tgcctgattc 


720 


agccttggac 


atcaatgagt 


gcctccgagg 


agcaagacgt 


gacctgaaag 


tatttgggaa 


780 


tttccccaag 


cttattatga 


cccaggtttc 


cjaagtcactg 


caagtcacta 


ggatcttcct 


840 

* 


tcaggctctg 


aatcttggaa 


ttgaagtgat 


caacacaact 


gatcacctga 


agttcagtaa 


900 


ggactgtggc 


cgaatgctca 


ccagaatgtg 


gtactgctct 


tactgccagg 


gactgatgat 


960 


ggttaaaccc 


tgtggcggtt 


actgcaatgt 


ggtcatgcaa 


ggctgtatgg 


caggtgtggt 


1020 


ggagattgac 


aagtactgga 


gagaatacat 


tctgtccctt 


gaagaacttg 


tgaatggcat 


1080 


gtacagaatc 


tatgacatgg 


agaacgtact 


gcttggtctc 


ttttcaacaa 


tccatgattc 


1140 


tatccagtat 


gtccagaaga 


atgcaggaaa 


gctgaccacc 


actattggca 


agttatgtgc 


1200 


ccattctcaa 


caacgccaat 


atagatctgc 


ttattatcct 


gaagatctct 


ttattgacaa 


1260 


gaaagtatta 


aaagttgctc 


atgtagaaca 


tgaagaaacc 


ttatccagcc 


gaagaaggga 


1320 


actaattcag 


aagttgaagt 


ctttcatcag 


cttctatagt 


gctttgcctg 


gctacatctg 


1380 


cagccatagc 


cctgtggcgg 


aaaacgacac 


cctttgctgg 


aatggacaag 


aactcgtgga 


1440 
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gagatacagc caaaaggcag caaggaatgg 
gaaaatgaag ggccctgagc cagtggtcag 
ccagctcctg agaaccatgt ctatgcccaa 
ggaagggttt gaaagtggag actgcggtga 
tgatggaatg ataaaagtga agaatcagct 
ggatgtggai; gatgcgcctg gaaacagtca 
cacctttcac aacctcggga acgttcattc 
ctcggtggtg tgcttcttct tcctggtgca 
ccctaoagca ccctgtggtc ttcctcgata 
tttttttttt tgttatcctg tatacctcct 
ttatgttttc. gaaaatcaaa tggtatcttt 
tagattgtcc ttttgcaa 



PCT/US02/21694 

aatgaaaaac cagttcaatc tccatgagct 1500 

tcaaattatt gacaaactga agcacattaa 1560 

aggtagagtt ctggataaaa acctggatga 162 0 

tgatgaagat gagtgcattg gaggctctgg 1680 

ccgcttcctt gcagaactgg cctatgatct 1740 

gcaggcaact ccgaaggaca acgagataag 1800 

cccgctgaag cttctcacca gcatggccat 1860 

ctgactgcct ggtgcccagc acatgtgctg 1920 

aagggaacca ctttcttatt tttttctatt 1980 

ccagccatga agtagaggac taaccatgtg 2040 

tggaggaaga tacattttag tggtagcata 2100 

2118 
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